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ABSTRACT 


The purpose of this study is to develop microprogranm 
optimization strategies which include an analysis of 
alternate action forms in the deletion of nonessential 
actions. Alternate actions (mMicro-operations) are determined 
by the identification of ali possible inputs and outputs for 
each action in a program node. The examination of alternate 
action forms provides a significant increase in the 
opportunities for nonessential action deletion since the 
program node may be restructured to force actions to assume 
a nonessential status. Many action form combinations may be 
used to generate different program node structures. It is 
necessary to identify a program node structure which results 
in the greatest number of action deletions. fTfhis optimal 
structure is not necessarily unique. 

The study poses and solves three major problems: The 
identification of alternate and equivalent action forms; the 
definition of deletion strategies which include an analysis 
oi alternate and equivalent action forms; and, the 
determination of a set of deletion strategies which may be 
employed in the optimal deletion of nonessential actions. 

The deletion of nonessential actions is directly 
applicable to both Microprogram Optimization and General 
Compiler Optimization. Therefore, the results of this study 


may be shared by both disciplines. 
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CHAPDER SL 


INTRODUCTION 


fhe development of high-level microprogramming languages 
and their compilers has led to microprogram optimization 
research. Microprogram Storage cost and frequency of 
execution are such that optimization is of Critical 


importance. 


Notable success has been attained in microprogranm 
optimization by augmenting existing software compiler 
techniques, with ones that reflect the differences between 
software and firmware (micro-code); however, the basis of 
Optimization remains the same for both: The identification 
and deletion of nonessential actions. Firmware actions are 
mMicro-operations which consist of one or more input memory 
sub-units, a primitive operation, and an output memory sub- 


unit. 


Nonessential actions (micro-operations) are divided into 
three general classes: Negated, redundant, and parallel. The 
purpose of this study is to develop optimization strategies 
which include an analysis of alternate input and output 
specifications for each action. Alternate and equivalent 
action forms are to be employed to identify all possible 
nonessential actions. Previous deletion strategies have 
examined only the source form of each action; a ae 


examination is given only to input and output memory sub- 
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units specified at the source-level of each action. The 
deietion of all nonessential actions requires the 
identification and analysis of all possible forms of each 


action. 


A deterministic algorithm is developed for the 
identification of all possible alternate and equivalent 
action forms; and then, optimization strategies are defined 
Which include an analysis for ali possible forms of each 
action. With analysis given to all possible forms, the 
definition of nonessential actions is expanded to be 


inclusive of the following actions. 


1. Redundant actions: Those actions whose outputs are 
predictable and currently availiable from other equivalent 
action execution. Equivalent actions are those actions whose 


outputs are identical with respect to value. 


2. Parallel actions: Those actions whose outputs may be 


defined by other equivalent actions. 


3. Negated actions: Those actions whose outputs need 


never be used. 


The approach used in this study is to identify alternate 
action forms by the analysis of equivalently defined memory 
sub-units. Deletion strategies are based upon an examination 
of alternate action forms which may be employed to force 
actions to conform to a nonessential status. Limitations of 


the approach are that run-time statistics are not used in 
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the determination of equivalent memory sub-units; analysis 


is performed only upon progran nodes, not regions; and, 


branch-point analysis is not performed. 


The literature survey, Chapter 2, presents a review of 
microprogram optimization techniques as well as a general 
review of miccroprogramming. The general review of 
microprogramming inciudes a brief history, philosophies, and 


languages. 


There exists a wide variety of microprogrammable 
machines and microprogramming languages; therefore, this 
study requires the proposal of a machine description and 
microprogram format which is sufficiently universal to avoid 
restrictive conclusions. Chapter 3 presents the machine 
description and microprogran format along with a detailed 


discussion of the study framework. 


A major problem dealt with in the study is the 
identification of all alternate and equivalent action forms; 
i.e., given N sequential actions, all possible inputs and 
outputs must be identified. The solution to the problem is 
given in Chapter 4 in the form of an algorithm and proof of 


its correctness. 


When alternate and equivalent action forms are 
determined, it is possible to force actions to assume a 
nonessential status. This is accomplished by the assignment 


of alternate input and output memory sub-units. A minimal 
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set of conditions are met, in order for an action to assume 
a nonessential status. These conditions, defined in Chapter 
5, are based upon a analysis of feasible alternate action 


forms and are applied to actions independently. 


From the conditions for deletion candidacy, it can be 
seen that there may exist more than one strategy for the 
deletion of an action. Furthermore, the selection of a 
particular strategy is a dependent event, and may effect the 
deletion candidacy of other actions; therefore, it is 
necessary to define ail possible deletion strategies and all 
conflicting deletion strategies. Chapter 6 defines tie 
strategies which may exist for the deletion of an action; 
the -conflicting strategies; and, the criteria of optimality 


in the selection of strategies. 


Conclusions and recommendations for further research are 
offered in Chapter 7. Findings of the study include the 
following: The set of redundant actions is covered by the 
set of parallel actions; the algorithm which identifies 
alternate action forms also identifies all cases where 
parallel hardware fan-out may be exploited; an examination 
of adjacent memory sub-unit definitions is sufficient to 
identify all negated definitions; and, order-isomorphisms 
may exist in the analysis of deletion candidates, but are 


identifiable and avoidable. 
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The deletion of nonessential actions is directly 
applicable to both Microprogram Optimization and General 
Compiler Optimization. Therefore, the results of this study 
may be shared by both “dasciplines. In “this” study, 
optimization strategies are developed for microprograms, 


rather than software programs, for the following reasons. 


1. Generally, micro-routines have a high frequency of 


execution. 


2en Microprogrameestores (contrel?*stores) are nore 


expensive than software program stores (main memories). 


3. Without loss of generality, subroutine calls and 


indirect addressing may be omitted from the study. 


CHAPTER II 


LITERATURE SURVEY 


2.1 Introduction 


This literature survey is designed to present a _ brief 
historical review of microprogramming along with a 
discussion of related research. In Chapter 3, a microprogran 
format 1S presented which is intended to include many 
aie carers microprogramming languages and philosophies. In 
Section le discussion is given to different 
microprogramming philosophies, and Section 2.4 provides a 
description of language categories. Section 2.5 is devoted 
to a review of recent microprogram compiler optimization 


research. 
2.2 Brief History of Microprogramming 


Since its definition by M. V. Wilkes in 1951 [53], 
microprogramming has enjoyed increased popularity as a 
viable alternative to hard-wired computer control. 
Miccroprogramming research has generally been directed toward 
two major areas: application and language development. 
Examples of applications would include the development of 
virtual machine emulators such as a virtual APL machine [1] 
and a virtual FORTRAN machine [38]. Examples of language 


developments are presented in Section 2.4. 
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Microprogramming was developed to provide a systematic 
and orderly approach to the design of control systems for 
digital computers. The control system is described in terms 
of register transfers within the processor; transfers may 


take place through an adder or other logic circuitry. 


The microprogram consists of a series of steps called 
micro-instructions which reside in a control storage device. 
As explained by Wilkes, the control storage device is made 
up of two READ-ONLY diode matrices, matrix A and matrix B. 
The A matrix routes the logic to be performed by the micro- 
instruction, and the B matrix is used to select the next 
micro-instruction for execution. A timing pulse along with a 
micro-instruction enters a decoding tree, and the 
appropriate connections are selected within matrix A. The B 
matrix output is then routed, via a delay circuit, to the 
micro-instruction address register; thus, the next micro- 
instruction is selected. Figure 2.1 displays the original 


model as defined by Wilkes. 
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Figure 2.1 Wilkes' Original Microprogramming Model[ 52] 


The use of microprogramming exclusively as a design tool 
continued untii the mid 1960's. Computers began to emerge 
which were designed with microprogramming capacities above 
the normal requirements of the host machine's instruction 
set. With the additional control storage capacity, the 
concept of using microprograms to support software became 
feasible. The term "firmware" was coined by Opler{43] in 


1967 to describe this interaction of software and hardware. 
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User interest in microprogramming gained additional 
stimulation with the use of RE AD-WRITE control 
memories(6,24]. Writable control stores allow easy storage 
and alteration of microprograms. With the ex panded 
microprogram capacity and writable control storage, both 
vendor and user became interested in eaSier ways in which to 
express microprograms. This led to the development of many 


different microprogramming languages. 


Several languages and their compilers have been 
developed to facilitate the expression of microprograms. 
Languages vary greatly in both their objective and 
structure, with some languages expressed in an assembler- 
like format [55], while others assume a compiled macro 
expansion structure [13]. Two major reasons for the wide 
variety of language types are the variety of applications 


and the variety of microprogramming philosophies. 


A study performed by Redfield{47] on the different 
technigues of microprogramming centered around three major 
microprogramming philosophies: monophase versus polyphase, 


parallel versus serial, and encoded versus highly encoded. 


ersus Polyphase Microprogramming 


2.3.1 Monophase 


Monophase and polyphase microprogramming (also called 
vertical and horizontal) are terms which refer to the 


measure of time for which a micro-instruction is active 
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during the execution of a particular micro-routine. A micro- 
routine is defined as the set of micro-instructions required 
to implement a specific operation, such as a "branch and 
link" (BAL) machine-language instruction. Ina monophase 
environment, the micro-instruction is active only for the 
Clock cycle in which it is initiated; each micro-instruction 
causes a single simultaneous issue of control signals. In a 
polyphase microprogram, the micro-instruction generates 
control signals which are active over more than one clock 


cycle. 


A strictly monophase approach may require impractical 
numbers of mMicro-instructions; thus causing excessive 
control storage fetches. However, a strictly polyphase 
approach may require an excessively large micro-instruction 


word. 
2.3.2 Parallel Versus Serial Microprogramming 


Parallel and serial microprogramming are terms which 
refer to the scheme used to seiect the next micro- 
instruction. The parallel approach utilizes a “best guess" 
philosophy by the selection of the next micro-instruction 
during the execution of the current micro-instruction. fhe 
selection may be made in a sequential manner, or it may be 
based upon data available from the micro-instruction which 
immediately preceded the currently active micro-instruction. 
In a serial approach, the next micro-instruction is selected 


upon the completion of the one currently being executed. 
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Micro-instruction word formats lie on a continuum scale 
between direct control (minimally encoded) and highiy 
encoded control. With the direct control scheme, each micro- 
instruction contains fields which control many different 
data paths within the processor unit. For example, a direct 
controi micro-instruction, such as that proposed by Tucker 
and Flynn[51], may contain fields which control mnicro- 
instruction sequencing, masking, Shifting, and adder 
operations (micro-operations). Computers which currently use 
a direct control micro-instruction word format include the 
RCA Spectra 70 and the IBM System/360 models 20, 25, 30, 40, 


S0;7,U65,;tand 8S. 


A highly encoded micro-instruction word format is 
analogous to the format of a one or two address machine- 
language instruction. The micro-instruction contains an 
operation code, which is decoded, and one or more operands 
which participate in the micro-operation execution. Two 
machines which use the highly encoded scheme are the 


Honeywell H-4200 and the Standard Computer IC-7OQOOE. 


In viewing each end of the continuous scale, it can be 
seen that the direct control approach requires a larger 
control storage word and is designed to exploit parallel 
data path operations; while the highly encoded approach 
requires more instructions and is generally more flexible 


with respect to algorithm expression. A further analysis of 
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the relative efficiencies of these two formats is presented 


in separate articles by Jakolat[25] ana Kurpanek[ 29 ]. 


The development of symbolic microprogramming languages 
paralleled the use of microprograms to augment software 
routines. Along with this development, interest has been 
generated in different micro-instruction word formats. The 
next section describes the two major categories of symbolic 


microprogramming languages. 
2-4 Language Categories 


Many languages for the expression of microprograms have 
been developed since 1967: Some draw heavily upon languages 
which have been developed expressly for the description of 
machine logic, such as Schlaeppi's LOTIS (499 3 
Microprogramming languages may generaily be classified into 
two broad categories: Those which assume a particular 
hardware configuration for the host machine; and, those 


which include facilities for defining the host machine. 


IBM's Control Automation System (CAS), designed for the 
System/360 line, is an example of the first category [24]. 
Microprograms are written aS a series of nicro-instruction 
ies which specify the action to be taken by each micro- 
instruction. These boxes are connected in a flow chart 
Manner to describe the microprogram. The information in each 


box is translated into a micro-instruction; then, the micro- 
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instructions are converted, after simulation, into control 


storage bit patterns. 


The format and content of micro-instructions change when 
microprogramming different models within the Systen/360 
line; however, each micro-instruction box generally contains 
fields for arithmetic statements, local and main storage 
addressing, data transfer specifications, and control 
storage addressing. The micro-instruction box for the 


System/360 model 40 is given in Figure 2.2. 
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Figure 2.2 System/360 Model 40 Micro-instruction Box 


The restrictive hature of machine-dependent 
microprogramming languages provides the primary stimulus for 
the development of languages which include facilities for 
machine description. The following section describes a 


language which has machine description capabilities. 
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2.4.2 Young's Lanquage 


A Mmicroprogram simulator developed by Steven Young of 
RCA is an example from the second language category [55]. 
The simulator includes facilities for the definition of 
machine elements, micro-operations, and micro-instructions. 
The system, ALSIM, is designed to allow machine definition 
and microprogramming of a wide range of machine 


configurations. 


ALSIM is a highly formatted, descriptive language. 
Generally, the system is divided into the following three 


mutually dependent sections. 


1. Machine description section: Defines the machine in 
terms of registers, busses, fields, functional memory, main 


memory, and processor functions. 


Ze Micro-operation section: Defines the micro- 
instruction format, the order of execution for functions 
within the micro-instruction, and hardwired or emulated 


primitive functions. 


a3 Micro-instruction section: Consists of micro- 
instructions coded in the format defined in the micro- 


operation section. 
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The definition of machine elements is accomplished 
through a specification of storage registers, data paths, 
memories, and basic machine functions. The general form of a 
machine element definition is 

DEFINE name, key word, qualifiers 

where "key word" specifies the type of machine element to be 
defined. For example, the following machine element 
definitions declare a 32-bit adder (ADD), register (A), and 
bus (TOT): 

DEFINE A,REG,32 

DEFINE TOT,BUS,32 

DEFINE ADD, ADDER,32,0,0V,31,C,C5S,0 
Qualification for the adder includes the specification of an 


overflow register and bit, along with carry registers. 


The micro-operation section is used to define the micro- 
instruction format, execution order of micro-instruction 
functions, test conditions, and primitive functions. fhe 
micro-instruction format is specified by use of the MAP 
Statement. For example, 

| MAP A,T,F,F,F¥,G,G 
specifies that the format of the micro-instruction is to be 
first, current address(A); next, a test condition(T); then, 


three functions(F); and lastly, two branch addresses(G). 


16 


The execution order of the functions in the micro- 
instruction is specified on the EOM statement. The EOM 
statement specities when functions are to be performed; when 
tests are to be evaluated; and, which addresses are to be 


selected. 


EQN statements are used to define tests and assign names 
to them. These names are used in the code specified in the 
micro-instruction section. For example, 

EQN AzTOT ACOMPSUM 


assigns the name ACOMPSUM to the comparison of A and TOT. 


Primitive functions are defined by the use of the FUN 
statement. The FUN statement defines operations to be 
performed upon machine elements. The operations are 
specified in terms of basic logic functions such as AND, OR, 
and TR (transfer). To define a function which would transfer 
register A to bus TOT, the following FUN statement may be 
used. 


FUN TRA TR,TOT,A 


The micro-instruction section is used to write the 
micro-routines. Micro-instructions are coded in the format 
of the MAP statement; with the execution order as given on 
the EOM statement; the mnemonic test names aS given on EQN 
statements; and, the functions as defined on the FUN 


statements. 
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In summary, the model provides an effective means for 
the description of a wide range of machine configurations. 
Furthermore, from the standpoint of assembler 
microprogramming, ALSIM is easy to use and follows a logical 
flow Of machine declaration, micro-instruction format 


description, and micro-instruction action. 


ALSIM and IBM's CAS have been presented to provide 
examples of different microprogramming language approaches. 
It can be seen that the two languages vary greatly in toth 
their structure and programming requirements. However, the 
micro-instruction level of all microprogramming languages is 
functionally the same: Primitive hardware functions are 
invoked for the purpose of defining or changing the state of 
machine elements. It is to this micro-instruction ievel that 
optimization strategies for microprograms are developed in 


this study. 


2.5 Microprogram Optimization 


The development of microprogram optimization strategies 
has made extensive use of techniques developed for software 
compiler optimization. Many techniques described in a text 
by Gries{20] and developed by Allen{2] and Lowry and 
Medlock{ 35] have been applied to micro-code optimization. As 
in software optimization, the microprogram is first divided 
into regions and program nodes within regions; then, 


optimization techniques are applied to each program node. 
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2.5.1 Regional Representation of Microprogram 


a 


In a broad perspective, for global optimization the 
microprogram is represented as a directed graph whose 
regions are operated upon to eliminate nonessential 
operations, optimize branch points, and maximize the 
exploitation of parallel hardware units [7,27,45,52]. The 
microprogram is represented as a linear list, R(i), of 
regions, R(1),R(2),-..,R(n), with each region made up of 
program nodes. A program node is a linear sequence of micro- 
operations which has one entry point (first micro-operation) 
and one exit point (last micro-operation). Branches (e.g., 


GO TO), if any, appear as the last actions in a node. 


Nodes are connected by arcs. These arcs represent a 
branch and are used to identify successor and predecessor 
nodes as well as articulation nodes. An articulation node is 
any node in a strongly connected region which is assured of 
being executed, if the region is entered. Thus, an 
articulation node lies on every entry and exit path of a 
region (see Figure 2.3). Regions are constructed with the 


following properties. 


1. R(i) is strongly connected: A strongly connected 
region is a set of nodes in which there is a flow path of 


arcs from any node in the set to any other node in the set. 
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Se Ee (iceeR LAs 
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R (i) 


Or H(i) Sh FR 3) 


i.e., R(i) is nested in R(j). 
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Figure 2:3 Regional Representation of Program Nodes: 
(a) Program Directed Graph; (b) Region 
Breakdown. 


20 


Optimization of regions is performed on the following 


basis: 


1. When R(i) is being optimized R(1),R(2),...,R(i-1) 


have already been optimized. 


2. When R(i) covers other regions, only nodes not 
contained in those regions must be examined. The set, R' (i), 
of unexamined nodes in R(i) is determined by 

i q 
pa Tae sf i 
BCBG) a. Blt 
i= 


L | 
2.5.2 Optimization Techniques 


Using a model defined by Cook and Fiynn{ 11], Klier and 
Ramamoorthy[27] investigated the use of the following 


optimization techniques. 
1. Identification and deletion of nonessential actions. 
2. Static frequency analysis of busy variables. 


3. Application of Tumasulo's algorithm for machine 


data-flow fan-out[ 52]. 


4. Optimization of branch-points by code rearrangement. 
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Nonessential operations (actions) which may be deleted 
are defined by Klier and Ramamoorthy as follows: 

“Redundant actions are those whose outputs are 

predictable and currently available from previous 

identical action executions. ... 

Negated actions are those activities whose results 

are never used." 


Conditions for the deletion of a successor action as 


redundant are given as follows. (from [ 27]) 


1. Two actions have the same inputs, operation, and 


output specification. 


2. No input sub-unit is redefined between the two 


identical actions. 
3. The output sub-unit is preserved. 


4. If one of the two actions is executed, then the 


other action is also executed. 


These conditions substantiate Theorem 13 in [50] by 
outlining the rules by which a redundancy algorithm may be 
constructed. It is also interesting to note that the 
register instructions defined in [50] may be mapped onto the 
micro-instruction format presented in [27]; i.e., software 
loop register operations may be expressed in a microprogram 


notation. 


Also in [27], the condition given for the deletion of an 


action as negated is that the output from an action appears 
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aS an output in a succeeding action, without first appearing 


aS an input. 


The important limitations in the above conditions are 
that only identical actions are considered, and equivalent 
sub-units are not identified; i.e., Only formal identities 
are considered. Therefore, alternate forms of an action are 


not considered. 


A review of techniques for the identification and 
exploitation of parallel operations is presented in [ 45}. 
Tumasulo's algorithm [52] is referenced by Klier and 
Ramamoorthy [27] as a means of implementing the deletion of 
actions which have the same output value. Developed for the 
IBM/360 model 91, the algorithm makes use of an operation 
stack and reservation stations to optimally queue variables 


which are to be fanned-out to parallel hardware units. 


Software compiler optimization and microprogram 
optimization have dealt with only identical actions and 
identical input and output memory sub-units, in the 
determination of nonessential actions; i.e., only the form 
of the action given at the source-level is considered. 
Chapter 3 describes the framework for the study, whose major 
objective is the development of optimization strategies 
which include an analysis of alternate and equivalent action 


forms. 
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CHAPTER III 


ANALYSIS FRAMEWORK AND PROBLEM DESCRIPTION 


3.1 Introduction 


The purpose of this Chapter is to describe the general 
framework for the problem analysis. Major sections include 
the following: The proposal of a machine description and 
microprogram format; a discussion of the problem analysis 
framework; and, the presentation of a graphical description 


of the problems addressed in this study. 


The machine description and microprogram format are 
proposed with the objective of including features found in 
many current machines and microprogramming languages. The 
Microprogram format is presented as a notation for the 


general description of a microprogram. 


The general framework and limitations of the study are 
presented in the section dealing with its scope. First, the 
problems dealt with in the study are described; second, the 
range of optimization is defined; third, the information 
collected for the problem analysis is discussed; fourth, 
nonessential actions are formally defined; and fifth, 
limitations of the analysis are presented. The last major 
section in the Chapter is a general discussion of a graph- 


theoretic approach to the problems addressed in the study. 
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—————- a ew eee ss 


Definitions which are used throughout this thesis 
(global) are preceded by the symbol sdef or def. sdef is the 
definition of a symbol which is presented to minimize 
verbosity and has no intrinsic relationship to conceptual 
development. def is a definition required to formulate a 


concept, theorem, proof, etc. 


In order to succinctly describe the probiems and 
solutions given in this thesis, it was necessary to develop 
additional notation. Wherever possible, notation is 
consistent with the existing literature. However, because of 
the nature of the problems addressed in the study, 


extensions to existing notation were reguired. 
3.2 Machine Description and Microprogram Format 


This section presents a general host machine description 
along with a description of a microprogram format. The 
microprogram format is closely aligned with the format 
described in [27] and is intended as a tool for the 
representation of a wide variety of microprogramming 


languages. 


To avoid a machine-dependent presentation, the host 
machine is described in general terms so as to include the 


aggregate features of many present-day computers. The host 
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is the machine which contains those functional hardware 
elements necessary to decode and directly execute each 
micro-operation used by the microprogram. A micro-operation 
is an operation which invokes a primitive hardware function 
(e.g., ADD, GATE, etc.), and requires a Specific guantum of 


time to be executed. 


The host machine is assumed to have the following 


hardware elements. 


1. Operational units: Units which are invoked during 
the execution of primitive operations. Examples of 
operational units are arithmetic units, selector gates, and 


Boolean logic units. 


2- Control Storage: The memory device which holds the 
microprogram. This device may be READ-ONLY or WRITE- 


controllable storage { 24]. 


3. Main Memory: The low-speed memory device from which 
machine-language instructions are fetched to be decoded, and 
ultimately executed by microprograms. The main memory may 
also contain data which is referenced by the machine- 


language program. 


4. Scratch-pad Memory: This is considered as high-speed 
memory (operating at approximately internal clock speed). It 
is used as a local store for an internal register file; 


intermediate result store ZOE microprograms; system 


- } - —! y) 


ek a | Tata 
| 


7 ite 7 a 

ee adores ef Bhs Gs aees bi: 
. abroad 

wridaniqeraanre . sm pa Fess ad? 78 nas a 


days erie oo s ; ‘ seiiiade » mann Sle@-e 


Tm. 
1) @nthe® pw — > went | pder ae pixie ree cl 
. _ - pa: 


tia diinte red * agi 
_ - 


. i 
et) Ve ae »! soos : sirnihed: veod “te 


,ae 

paiine ebrocel ahe aviae, coiml let4m, fairs aaa" 

= Sins « ae Lae G4¥ LA f 2 Co i iroope © 

ods \OuGP nama’ {P hau 7OG4? >* sunt Lenuat y io : 
+. eee otquibaiatoete 

" ; 
oF site Mp ies pare | joie, iuziowy 7. 
-—~i?e ‘Ww ‘eam I— cae i y » ags cp4ee er? 


7 7 a 
“LNG? Gewese! des 44 oo 


= 


9th) G@St Stat qalee | “stl ol. '9iieeR- Alee ~ ‘ 
Ath «RN ad “ALBA RBE SS: <8 ee A ie foe 


— 7 
tne qaeeme W654 oe ten Shee Cha leaies ae Vewtnadiae 


<a omit el enone - re Meare end aocdacs aa - : 
sBOTUP V4 MORE 


i ef ded Hate ai 10998 ON 
pa niio. rae ae Awe Me eaian zoey 


le® Gesod ee 
pee 


26 


accounting storage; or, for any other application which 


requires fast storage [ 17 j}- 


5. Data Paths: These are the paths which connect the 
memory units to the operational units. The network of data 
paths together with the operational units and memory units 


succinctly describe the host machine. 


The host machine is further qualified by the following: 

NP Number of primitive operations. 

NS Number of memory cells (also called memory sub- 
units) which may be used as input and/or output to 
primitive operation i, 1<i<NP. Note: Generally, 
these memory cells reside in high-speed memory. 

NO Number of operational units. 

T A two-dimensional array which represents the 
internal clock time, T(i,j), to execute primitive 
aperation’.i,;* *using® (operational Gtnit. Jj, 1) 1<1Sue; 
1<45<NO. 

P A two-dimensional array which represents’ the 
possible participation, P(i,k), of memory sub-unit k 
as the input and/or output to primitive operation i, 
A45KSNS, 1<i<NP. P(i,k) = 1 if participation may 
occur, 0 otherwise. 

These qualifications will aid in the discussion of the 


microprogram format presented in the next section. 
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3.2.2 Microprogram Format 


A notation for the general representation of 
microprograms is presented below. For the purpose of this 
study, it is assumed that the compiler and/or assembler has 
generated the microprogram in the form described below. No 
attempt is made here to describe this translation process; 
it is, however, asserted that such a translation is feasible 


in terms of current technology. 


To avoid a restrictive format, the microprogram is 
described to include highly encoded (vertical) and direct 
(horizontal) microprogram formats. It is noted that the 
difference between the two microprogram types iS on a 
continuum scale; and, such a universal definition likely 
lacks the direct inclusion of technical characteristics 
which are unique to a specific language and machine 
configuration (e€.g., emit fields, masks, etc.). This does 


not detract from the generality of results, as provisions 


are made for such technical qualifications of micro- 
instructions. 
The following definitions are given before the 


discussion of the microprogram format is presented. 


def. 3.1 Source Unit: The memory sub-unit used as input to 


a micro-operation. 
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def. 3.2 Sink Unit: The memory sub-unit to which the 


result of a micro-operation is gated. 


def. 3.3 Microstep: Time required to execute one micro- 


instruction. 


def. 3.4 Micro-instruction: The unit of a microprogran 
Which contains the micro-operation(s) to be executed in one 


microstep. 


def. 3.5 Micro-operation: The unit of a micro-instruction 
which causes a primitive hardware function to be invoked. 
Generally, a micro-operation specifies operational units, 
source unit(s), and sink unit(s). The micro-operation is 
functionaliy defined by the following operational classes. 

a) Data-flow actions: Those micro-operations which 
cause a transfer of data within the machine. 

b) Content permutation actions: Those micro-operations 
which define or change the state of memory sub- 
units. 

c) Directional control actions: Those micro-operations 
which control the selection of the next micro- 


instruction (and micro-operation) to be executed. 


It is important to note that the terms micro-operation and 
action are synonymous; however, the latter is used where 


functional qualification serves to clarify a point. 
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The definitions and qualifications given previously have 
established a basis upon which a general microprogram 
notation may be developed. The notation presented below is 
Similar to that described by Klier and Ramamoorthy [27]. The 
{c}, and {TV} qualifications have been added to the notation 
given in [27], in order to include tining constraints and 
control information. Also, because this study deals with 
alternate memory sub-unit assignments, it will be necessary 


to reference the memory sub-units by subscript notation. 


Given that there are H micro-instructions and that there 
may be L micro-operations defined by each micro-instruction, 
the following notation is presented to describe a 
microprogram. 
apo Upepese, H 


. 
{I(p,q) = Op {sc} {sk} ic} {OU} {TV} 
L Sq=1,2,---,¢L 


where 
I An index which uniguely identifies the micro- 
operation within the micro-instruction. 
Op One of the primitive operations. 
{sc} The memory sub-unit(s) used as the source for Op. 
{sk} The memory sub-unit(s) used as the sink for Op. 
{Cc} A control field which contains information such as 


control bits, shift amounts, selection masks, etc. 


{OU} The operational unit(s) required to execute Op. 
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{TV} Time validity, T(i,j), in terms of internal clock 
cycles, of executing Op(i) using {sc}, {sk}, and 


{OU}, 1<i<NP, 1<5<NO. 


AS an example, consider a micro-instruction encoded as 
described by Tucker and Flynn[ 51}: 

BS a -R5tR6 oR s=<27R7, GOrto, P2. 
The micro-instruction is divided into three micro-operation 
fields: ADD, SHIFT, and SEQUENCE. It is coded to add R6 and 
R5 and then gate the result to R3, shift R7 left two bits, 
and select P2 as the next micro-instruction address (/R7 
denotes contents of R7 are to be shifted). Assuming the 
micro-instruction was the second one in a program, its 
expression in the notation given would be as follows (assume 
address of P2 is held in R4). The {0U} and {TV} fields are 


not shown. 


Index Op sc sk c 
Tag2ie tps ADD {536} {3} 
Ei2e2)i: SHIFT {7} {7} {-2} 
T (273): JUMP {4} 


The microprogram format has been presented as a4 means 
for the general expression of a wide range of 
microprogramming languages. It also provides a basis upon 


which the problem analysis framework may be described. 
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roblem Analysis Framework 


The purpose of this section is to describe the framework 
of the study in terms of the following: The problems 
addressed in the study; the set of actions considered for 
optimization; the information collected from the set of 
actions; the actions which are considered for deletion; and, 


the limitations of the analysis. 


Simply stated, the problem undertaken in this study is 


fo 


the deletion of nonessential actions from a set of 
sequentially executed micro-operations. Previous attempts at 
the deletion of nonessential actions have relied upon an 
evaluation of only the source form of each action; however, 
the deletion of all nonessential actions requires the 
identification and evaluation of all possible forms of each 


action. For example, consider the following example of four 


sequentially executed, highly encoded actions. 


Eas ADD {334} {5} 
TQ): GATE {53 {6} 
I (3): GATE {3} iy 
(hy): ADD {734} {5} 


In this example, an examination of only the source-level 
form of each action yields no nonessential actions. An 
evaluation of the equivalent memory sub-units, {3} and Thy 


yields another form for i(4), ADD {3; 4} (Sie iS 
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alternate form for I(4) is redundant with respect to I1(1) 


and is, therefore, nonessential. 


The significance of the problem is based upon the 
premise that nonessential actions are most likely to occur 
in action forms which are alternate and equivalent to source 
forms. Nonessential action deletion is directly applicable 
to both Microprogram Optimization and General Compiler 
Optimization; therefore, the results of this study may be 


Shared by both disciplines. 


Optimization strategies are to be developed Which 
include an analysis of all possible action forms. In order 
to achieve this goal, the following three major problems are 


addressed in this study. 


1. All possible forms for each action must be 
determined. The variables of form for an action are input 
and output memory sub-units. Therefore, the problem is to 
determine all memory sub-unitsS which may participate as 


inputs and/or outputs for each action. 


2. Given that alternate action forms are determined, 
conditions for the deletion of an action must be defined in 
terms of alternate action form assignments. The problem is 
to define a minimal set of conditions which must be met in 
order for an action to be a negated, redundant, or parallel 


candidate for deletion. 
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3. Given a set of deletion candidates, all possible 
deletion strategies must be identified for each deletable 
action. The problem is to identify the strategies available 
for the deletion of an action; and, to Specify conflicting 
strategies. With all strategies identified and conflicting 
strategies determined, the final problien zs the 


determination of optimal deletion criteria. 


In a broad sense, the solution to the above problems 
requires the following: The identification of alternate 
action forms; and, the development of strategies based upon 
deletion advantages gained through alternate action form 
assignments. A general statement of the approaches used in 


the solution of the problems described is presented next. 


3.3.1.1 General Statement of Solution Approach 


With N sequentially executed actions, i equivalent 
actions (i<N), and j equivalently defined output units 
(j<i), examination is given to determine all possible forms, 
F(m), of action(n), m=1,2,...;N. Equivalent actions and 
eguivalent units are used to identify all possible source 
and sink unit combinations for each action; i.e., variables 


of form for actions are source and sink units. 


An analysis of F(m) is performed to identify ali actions 
which may be deleted. A minimal set of conditions is defined 
for negated, redundant, and parallel actions. Those actions 


which conform to at least one set of deletion conditions 
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make up the set of actions, D, which are candidates for 


deletion. 


The strategies which may be employed in the deletion of 
each action in D are defined. Also, the conflict of 
strategies is defined, since the strategies employed in the 
deletion of one action may affect the strategies available 
forstheideletion» of sother: actions... With, strategies and 
strategy conflicts defined, it is possible to select one 
form, Z(k), for each action, Which will result in the 
optimal deletion of actions from the original set of N 


actronss atx) © Fi(m) . k=l 2y0c0 eg Ns 


The following section describes the range of actions 
over which the optimization strategies are to be applied; 


i.e., the set of N actions to be examined is described. 


Analysis of nonessential actions is performed only upon 
nodes in properly nested (or simply nested) regions of the 
microprogram [2,7]. The microprogram notation described is 
used in all program node examples. The definitions given 
below will aid in the discussion and analysis of program 
node examples. Throughout the remainder of this thesis, the 
term "unit" will generally be used in place of the more 


verbose "memory sub-unit". 


sdef. 3.1 WN: Number of actions in a program node. 
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sdef. 3.2 NS: Number of memory sub-units. 


Sdef. 3.3 nA: Number of elements in A, where A may be any 


defined variable. 
sdef. 3.4 MO(i): The ith action in a program node, 1<i<N. 


sdef. 3.5 OP(i), sk(i), and c(i): The primitive operation, 
Sink unit, and control field respectively, specified in 


MO (i). 


def. 3.6 sc(i,j,k): Reference source units within MO(i) 
and viay have one’ (1), two ‘G,4),. or three (7a) 
subscripts. 

sc(i) references all possible source unit combinations 
(forms) for MO(i); e.g., sc(4) references ali possible 
source unit combinations for MO (4). 

sc(i,j) references the jth combination of source units for 
MO (1), ‘1S]jSmsc (i); e.g., sc(2,3) references the 3rd source 
unit combination for MO(2). Note: j=1 always references the 
source unit combination given in the source form of the 
action. 

sc(i,j,k) references the kth source unit in the jth source 
unit combination for MO(i), teksnse (4779) 7 (es0%, ee (42,1) 
references the 11st source unit in the 3rd source unit 


combination for MO(4). 


The program node example given in Figure 3.1 is used as 
a basic example in the explanation of algorithms and 


analysis technigues. To reduce complexity, action examples 
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do not contain the {00} and {TV} fields given in the 
microprogram notation. It is assumed that before code 
changes are made, precautions are taken to insure against 


the creation of timing hazards. 


To further reduce complexity, action examples will 
contain a maximum of two source units and one sink unit; 
also, nodes will contain only highly encoded micro- 
instructions. These simplifications are not restrictive, as 
the theoretical framework of the thesis is based upon any 


encoding structure and arbitrary source unit bounds. 


PO: GATE {2} {3} 
T(2): GATE {4} {6} 
£(3):: ADD {336} {7} 
I(4): ADD {432} {2} 
E(5)i2 ADD {234} {8} 
1(6).: GATE {7} {5} 
Tere: ADD (2; 6} {9} 
I (8): SUB {7 33} { 3} 


Figure 3.1 Program Node Example 


It is useful to represent a program node in terms of the 
unit assignments to each action. This is done by the 
creation of a unit assignment table (UA) for each program 


node. The structure of the unit assignment table is given in 


the following section. 
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Given that there are NS memory sub-units (e.g., 
registers) and N actions represented in the node, the unit 
assignment tablet) witi) bel an’ (N#1)) thy (NS farray. The 
additional row, UA(N+1,j), is used to represent those units 
which are busy on exit from the node [35]; i.e., those units 
used as inputs in successor nodes, before they are 
EGdet ined. UA(NtH,3) (= 1) if unit? {J} | ts) busy on -exit,; 2 
otherwise. UA(i,j) = 01114213 respectively, if unit {Jj} is 
not used, used aS a source unit, used aS a sink unit, or 
wWsed t4a5) 4a “source ftand’ Sink’ unit?for)}MO(1)) 1=1,2,..+-;,N- 


(vertical bar, |, used throughout as an OR symbol) 


For example, if units {3} and {8} are busy on exit from 
the program node given in Figure 3.1, then the unit 
assignment table generated for the program node is aS given 


in Table 3.1. For Table 3.1, N = 8 and NS = 9. 
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Table 3.1 Unit Assignment Table for Program Node Given 
in Figure 3.1. 


Memory Sub-units 


Index 1 y S 4 5 6 cs 8 9 
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The unit assignment information, along with eguivalent 
action and equivalent unit information, is used in tae 
determination of nonessential actions. The following section 


presents definitions GE equivalence relations to be 


collected from the program node. 


As ‘stated earlier, a major problem addressed in the 
study is the determination of alternate action forms. The 
variables of form for actions are source units and sink 
units. In order to determine all possible combinations of 
source units for each action, it is necessary to identify 
eguivalent units. In order to determine all possible sink 


weiee. for “each action, it 1s necessary to identify 
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equivalent actions. Simply stated, the information to be 
collected from the program node is that information required 
to identify alternate source and sink units for each action. 
The information to be collected is presented in the 


following definitions. 


sdef. 3.7 QMSU: Memory sub-unit qualified by its index of 
definition. A QMSU is an ordered pair (u,uid), where u is 
the identifier of the unit, and uid is the index of the 
action which defines u. For example, (3,6) represents unit 
{3} as defined by the 6th action in the program node. If the 
unit is busy on entry into a program node, its index of 


derinztetron \is#zero. 


The qualification of units is an important feature of 
this study. It allows sequential analysis of actions to be 
performed without the traditional program optimization 


problems created by redefinition. 


def. 3.6 Equivalent units: Those QMSUs whose contents are 
identical; i.e., where 1S5i,m<NS and 1<j,k<N, 
(i,j) <=> (m,k) 


if MO(j) <=> MO(k). 
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def. 3.7 Equivalent actions: Those actions whose outputs 
are identical with respect to value; i.e., where 1<5i,j,k<N, 
MO(i) <=> MO(}) 
if one of the following is true: 

Opity = Opti) Gi) = c(j), and sc(i,1) <=> sc{j,1); 


Gen elOt ae ke 


Op(k) = data-flow, sk(k,1) = i, and e(k). =og: 
Gr LOL. ji. Ky, 
Op(giie= cata-tlow, Ssk(jeat) = Kk, and c(4). =.¢. 


it is important to note that equivalent units and equivalent 


actions conform to transitive laws. 


def. 3.8 Alternate source units: Those QMSUsS which may be 
defined before MO(i) and are equivalent to a QMSU specified 
in the source form of MO(i); i.e., where 1<m,t<NS and 
TEKS SW y 

(m,k) is an alternate source unit for (t,s) at MO(i) 


iff k < i and MO(k) <=> MO(s). 


def. 3.9 Alternate sink units: Those QMSUs which are 
equivalently defined; i.e., where 1<m,t<NS and 1<k,s<N, 
(m,k) and (t,s) are alternate sink units for each other 


iff MO(k) <=> MO(s). 


With the unit assignment table described and the 
equivalence relations defined, it is now possible to 
formally define all nonessential actions considered in this 
study. The definitions given in the next section qualify 


nonessential actions to include general deletion strategies. 
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Nonessential actions are defined in terms of the unit 
assignment table and equivalence relations. As given in 
Chapter 1, nonessential actions include three classes: 
Negated, redundant, and parallel. This section further 
gualifies the classes aS parallel-forward, parallel- 
backward, redundant-forward, redundant-backward, and 
negated-forward. This qualification is required in order to 
represent all possible deletion strategies; e.g., if MO{i) 
and MO({j) are parallel actions, MO(i) <=> MO({j), then sk(i) 


may be moved to MO(j), or sSk(j) may be moved to MO(i). 


def. 3.109 Paralliel-forward action:»' HC (4) is parallel- 
forward with respect to MO(j), 1<51,j<N, (and may be deleted) 
iff MO{i) <=> MO({j5); 

2eG O73 

Sead 
y UA (m,Sk({i)) = 0; 

m=i+1 

UA(j,Sk(i)) = 012; 

Either sk(i) may replace sk{j) at MO(j) 


or sk(i) may be fanned-out at MO(j). 
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def. 3.11 Parallel-backward action: NOG) Sis @paraiie:— 
backward with respect to MO(i), 1<i,jSN, (and may be 
deleted) 
iff MO(i) <=> MO(4); 
aie 
where sk(i) # sk(j) 
J 
» UA(m,Sk(J)) = 23 
m=it1 
where sk(i) = sk(j) 
OP B=Ut W142 ,%---:, 7 | 
UA(m,sk(j)) = 1; 
Either sk(j) may replace sk(i) at MO(i) 


or sk{(j) may be fanned-out at MO(i). 


def. 3.12 Redundant-forward action: MO(i) is redundant- 
forward with respect to MO(j), 1S5i,jSN, (and may be deleted) 
Get MGi@) <=> HO); 
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def. 3.13 Redundant-backward action: MO(j) ais redundant- 
backward with respect to MO{i), 1S$i,j<N, (and may be 
deleted) 
LEE BO (i je<=> MOG); 

Tee ie 

Sk(i)- = sk (4); 

POmem alt eras aig =i 


UA(m,sk(j)) < 1. 


Theorem 32.70°An ‘evaluation of: all «parallel sactions is 


sufficient to identify all redundant actions. 


PLL: 

by smnsexatinataon. of deft 3.10, 3.142, 3. 11, and 3.13, it can 
be seen that the conditions for forward and backward 
redundancy are included in the conditions for forward and 
backward parallelism. If MO(i) is redundant with respect to 
MO (5), then sk(i) = sk(j) and replacement is trivial; 
therefore, the conditions for parallelisn, with 
Sk (iy =esktjy; are identical to the conditions for 
redundancy. Hence, the set of all redundant actions in a 
program node is covered by the set of all parallel actions. 


Q.E.D. 
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def. 3.14 Negated-forward action: MO(i) is negated-forward 
with respect to MO(j), 1S5i,j<N, (and may be deleted) 
Her ok(a) = sk Gy; 
Uae 
i) 
PY UA (mn psk (a) )) =.2- 
m=i+1 
From Theorem 3.1, it can be seen that an evaluation of 
parallel-forward, parallel-backward, and negated-forward 
actions is sufficient to identify all nonessential actions. 
Previous research [27] has presented a separate analysis of 
parallel and redundant actions. Theorem 3.1 provides a more 
general basis of nonessential action analysis; since, 
nonessential actions may now be redefined as negated and 


parallel. 


The general approach used in this study may now be 
stated in terms of these nonessential actions: It is to 
identify those alternate unit assignments which may be made 
in order to create a parallei-forward, parallel-backward, or 
negated-forward condition in the unit assignment table. The 
following section describes the specific limitations imposed 


upon the development of the approach. 
3.3.5 Limitations of Analysis 


The following are the limitations in the analysis of 


nonessential actions, as proposed in this study. 
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1. Analysis is performed only upon nodes; a. Ga, 


regional analysis is not included in the study. 


2. No run-time statistics are used in the 
identification of equivalence relations. For example, under 
the analysis proposed in this study, QMSUs (3,1) and (2,3) 


are not identified as equivalent in the following program 


node. 
I(1): GATE {6} {3} 
Na ADD {3, 7} {4} 
I (3): SUB rg ae & 2} (assume {4} - {7}) 


Of course, if run-time statistics were collected, QMSUs 


(3,1) and (2,3) would be identified as equivalent. 


a= Directional control actions are not considered as 


Candidates for deletion. 


The framework and limitations of the study have been 
discussed, and the major problems addressed in the study 
have been described. The next section presents a graph- 
theoretic description of the major problems. It is presented 
Oe provides ay concises description of «the optimization 
objective function and the combinatoric nature of the 


problems undertaken in this study. 


The objective function is satisfied, if all nonessential 
unit definitions are deleted from the program node. Ciearly, 


the problem involves combinatorics: To find the combination 
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of units and primitive Operations which will negate the 
largest number of micro-operations. The solution involves 
the identification and evaluation of all possible source and 
sink units. The problen may be expressed by the 
representation of the program node as a directed graph 
(Similar to a dependency graph [2]}). Sink units are 
connected to source units by operational arcs, with each 
action represented as a level in the tree. Unmodified 
(c(i) = @) data-flow actions are represented by vertical 
operational arcs. For example, the following program node is 


represented by the directed graph in Figure 3.2. 


E(1): GATE {2} {3} 
uae 9 GATE {4} {6} 
Ey: ADD {3776} {7} 
I(4): GATE {7} {5} 


For the analysis of the above program node, assume that 
units {2} and {4} are defined on entry into the program 
node, and that the 1(4) definition of unit {5} is used in 


Successor nodes. Examination of the code shows that the 
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Figure 3.2 Directed Graph for Simple Program Node 


possible source units (represented as QMSUs) for I(3) are 
Pages eae soot Ot, (27036,2} Aepand” {2,0;4,0)} 3; and, the 
possible sink units for 1(3) are {7,3} and {5,3}. With four 
possible source unit combinations and two possible Sink unit 
combinations, 1({3) has eight possible combinations. If an 
exhaustive tree searching approach is used, then after seven 


regenerations, the following optimal tree is selected. 
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While a graph-theoretic approach provides an effective 
way to represent the program node, it provides no advantage 
in the determination of QMSUS which may participate as 
alternate source and sink units. Although the optimization 
of the above program node is given in two-dimensional space, 
the analysis of the general problem involves three- 
dimensional space. In the above example, alternate units are 
identified by an analysis of vertical operational arcs (GATE 
Operations), and each micro-operation is identified by a 
tree level; however, the general problem reguires the 
identification of equivalent QMSUs, and the identification 
of equivalent operations. For example, if the above program 


node is coded with an equivalent action at I(4), 


I (1): GATE {2} {3} 
m2): GATE {4} {6} 
ES): ADD {3; 6} {7} 
T(4y : ADD {2;4} {8} 
15): GATE {7} {5} 


then the graphical representation would be as given in 


Pique 323. 


Equivalent actions, I1(3) and I(4), must be represented 
in the tree so that all alternate source and sink units may 
be identified. In order to represent equivalent actions, in 
two-dimensional space, it is necessary to perform the 


following functions: 
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Figure 3.3 Directed Graph of Program Node With 
Equivalent Actions 


1. With each action examined, the sub-graph which 
emanates from the source unit nodes is examined in order to 


identify equivalent predecessor actions. 


2. A set of forward pointers is defined for the 
predecessor actions which are equivalent to the current 
action, and a set of backward pointers is defined for 


eguivalent successor actions. 


3. The current action's sink unit is identified as an 


alternate sink unit for the equivalent predecessor actions. 
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Whiie the above functions are not impossible to perfora, 
the graphical representation of all QMSUS which may 
participate as alternate source and sink units involves 
needless recursive analysis of predecessor and successor 
sub-graphs. Furthermore, there is ho reduction of 
information when MO(i), i=1,2,...,N, are converted to their 
gtaphical representation, since Op(i) and c(i) must be 


maintained for the analysis of equivalent operations. 


Although a graph-theoretic approach is feasible, the 
identification and representation of equivalent QMSUs and 
actions present needless computational complexity. In the 
next Chapter, it is Shown that all alternate source and sink 
units can be identified, with a maximum of two examinations 
of each micro-operation in the program node. For an analysis 
of other approaches to similar combinatorics problems see 


[ 39}. 


The major objective of this Chapter has been to describe 
the framework for the study. The problems dealt with in the 
study were defined; the limitations and means of analysis 
were presented; and, the objectives of the study were 
discussed. In the next Chapter, a major problem of the study 
is undertaken: The identification of all possible forms of 


each micro-operation. 
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CHAPTER IV 


IDENTIFICATION OF ALTERNATE ACTION FORMS 


4.1 Introduction 


Se a ss = = 


The objective of this Chapter is to present and _ prove 
the correctness of an algorithm which identifies all 
possible forms of each action in a program node. As_ stated 
in Chapter 3, the variables of form are source and sink 
units. In Section 4.2, alternate source and sink units are 
defined as to type. A general description of the alternate 
unit identification algorithm is presented in Section 4.3 
along with detailed descriptions of each pass in the 
algorithm. A proof of the correctness of each pass is 
offered in Section 4.4, and then a brief discussion of 


parallel actions is presented in Section 4.5. 
4.2 Alternate Unit Types 


It was found that there are three types of aiternate 
source units: Those units which are not redefined (Current 
value) before the appearance of the action to which they are 
alternates (Type C); those units which are Redefined before 
the appearance of the action to which they are alternates 
(Type R); and, those units which are defined by the 


assignment of Alternate sink units (Type A)- 


The approach used to identify alternate sink units is to 


represent, for each action, those actions to which the sink 
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unit definition may be assigned. From def 3.9, it can be 
seen that if sk(i) may be assigned to MO(j), then sk(j) may 
be assigned to MO(i); therefore, the representation of all 
actions to which sk(i) may be assigned is sufficient to 
identify alipailternate sink units for MO(i), i=1,2,...,N- 
There are two types of alternate sink unit references: Those 
which represent Predecessor actions to which a sink unit may 
be assigned (Type P); and, those which represent Successor 


actions to which a sink unit may be assigned (Type S). 


The conditions which define each unit type are given 
below; also, the information required to uniquely identify 
each unit 1s presented. It should be noted that the 
conditions described below define possible alternate units; 


the conditions of feasibility are defined in Chapter 5. 


4.2.1 Alternate Source Unit Types 


def. 4.1 Type C alternate source unit: (m,k) is a Type C 
alternate source wnat “<fLor (clopspPe kat > BOE 7e isn Pesis7y 
iSESSZESN;, 
a le a Cte See Fee 

MO(k) <=> MO(s) ; 

Por jakt kts pn 7 tl 


On. (45s 1 


Unique representation of (m,k) as a Type C alternate source 


unit is written as: (C,m,k). 
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def. 4.2 Type R alternate source unit: (m,k) is a Type R 
alternate source unit for (t,s) at MO(i), 1<m,t<Nns, 
{Ses -LSN, 
‘Pte 1X 13 

MO(k) <=> MO(s); 

POLFat eed st one, jaekt th Al 3c se 7 


UA (t7n). > 1- 


Unique representation of (m,k) as a Type R alternate source 


Unite) CR i, Kk): . 


def. 4.3 Type A alternate source unit: (m,k) is a Type A 
alternate source unite “for .{tes), «ac. NOG) 7. tm,23Na, 
Tk, Sel =N,; 
Peta ke yous 

Sk(x) = nm; 

MO(x) <=> MO(k); 


MO(k) <=> MO(s). 


Unigue representation of (m,k) as a Type A alternate source 


unit at MO(i) (assume MO(x) defines m): (A,m,k,x). 


Alternate sink units nay be identified by the 
representation of equivalent actions. The next section 
describes the two types of alternate sink unit references: 
Eguivalent successor action references and equivalent 


predecessor action references. 
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def. 4.4 Type P alternate sink unit reference: MO(i) is a 
Type P alternate sink unit reference at MO(j), 1<i,j<N, 
PEER i acs}: 

FOG>S.02 


MO(i) <=> MO(4). 


Unique representation of MO(i) as a Type P alternate sink 


unit reference at MO(j): (P,i). 


def. 4.5 Type S alternate sink unit reference: MO({i) is a 
Type S alternate sink unit reference at MO(j), 1Si,j<N, 
PEL 21s 

j iO; 


MO(i) <=> MO(j4). 


Unigue representation of MO{i) as a Type S alternate sink 


unit reference at MO(j): (S,i). 


Alternate source and sink units have been defined as to 
type. The next major section describes the organization and 


representation of alternate unit information. 


The organization of the alternate unit information is in 
tabular form. The alternate unit assignment table (ALT) is 
used to represent prime and alternate source and sink units. 
The alternate unit identification algorithm, described in 


Section 4.4, constructs the ALT. 
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4.3.1 Alternate Unit Assignment Table (ALT) 


The ALT is designed to represent all possible forms of 
each action by the specification of all alternate source and 
Sink unit assignments. It is generally structured to contain 
the following: Unit references which may participate as 
alternate source units; and, action references to which sink 
units may be alternately assigned. A precise statement of 


the structure and referencing of the ALT is given next. 


With N actions, p source units per action (maximum), and 
one sink unit per action, the ALT is an WN by (pt1) array. 
Each element in the array is divided into two parts: The 
prime unit specification and the alternate unit 


specification. 


The prime source unit specification is an ordered pair, 
QMSU, which represents the source unit specified in the 
source form of the action. The prime sink unit specification 
fstian *tordered | pair, (u uid)ip (wheretulistthe; identif£ierroft 
the sink unit specified in the source form of the action, 
and uid is the index of the action. The prime unit fields 
for both the source and sink units are ordered pairs 
referenced in the ALT as scalars: ALT (i,psc(x)) references 
the rth prime source unit for MO(i), 1<rsnsc (1,1) ; 
ALT(i,psk) references the prime sink unit for MO(i); and, 


ALT(i,psc) references all prime source units for MO(i). 
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The alternate source unit specification is an ordered 
triple if the unit is a Type C or Type R alternate, and an 
ordered quadruple if it is a Type A alternate. The order of 
values is as defined in Section 4.2.1. The alternate source 
unit field in the ALT is referenced as a vector of entries 
where each entry is an ordered set: ALT (2-asctr yd 7k) ) 
references the kth value in the jth ordered set which is an 
alternate, for prime Sources unit ‘ALT (i,psc (£)) ,.1s<rsnsc(i, 1); 
ISjsnALT(i,asc(r)), 1Sks4. ALT(i,asc(r,j)) references the 
jth ordered set; ALT(i,asc(r)) references all ordered sets 
Witch ware j\alternate »for (ALT GQppsc(z)); aad, ALT (izasc) 
references all ordered setsS Which are alternate source units 


in MO(i). 


The alternate sink unit specification is an ordered pair 
whose values are as defined in Section 4.2.2. The aiternate 
sink unit field is referenced as a vector of ordered pairs: 
ALT (i,ask(j,k)) ceferences the kth value in the jth ordered 
pair which represents an alternate sink unit assignment for 
MO{i), -1SjSnALT(i,ask(j)), 185kS2. ALT(i,ask(j)) references 
the jth ordered pair for MO(i); and, ALT(i,ask) references 
all ordered pairs which represent alternate sink unit 
assignments for M04). Co clarify the ~structure <eaad 


referencing of fields within the ALT, an example is given. 
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4.3.1.2 ALT Example 


AS an example of the ALT structure, Table 4.1 gives the 
ALT generated for the program node in Figure 3.1. To review 
the notation and aid in understanding the ALT, the 1(3) 
entry in Table 4.1 is discussed. The first prime source unit 
tor M03) is {3}, the second is {6}. Relative to MO(3), 
units {3} and {6} are last defined at MO(1) and MO(2) 
respectively. Thus, the ordered pairs, (3,1) and (6,2), are 
QMSU's which represent prime source units for MO(3). In 
terms of referencing these pairs as ALT elements, 
ALT (S7pse(l)) = @,i) and ALT (3,psc(2)) = (6,2). Similarly, 
the prime sink unit entry, (7,3), references unit {7} 


defined) at) NO(3); and, ALT (3,psk) = (7,3). 


The first alternate source unit entry, (C,2,0), means 
that unit {2} defined at MO(0) [on entry] is a Type C 
alternate for unit: {3} at MO(3) [recall, Type’C méans unit 
{2} is not redefined between MO(0) and MO(3) J. In terms of 


referencing the ALT elements, ALT(3,asc) = (C,2,0) (Cp 40)% 


ALT (3,asc(1)) = (C,2,0), and ALT (3,as¢ (2),} (C7 4,0): 


2s 


ALT (ease (2, 1) = (C,4,0); and, ALT (3,asc (2, 1;,2)) 


The. alternate sink unit entry, (S,6), means that MO(6) 
is a Type S alternate sink unit reference for MNO(3)3 and, 
ALT (3,ask(€1)) = (5,6). That LS; MO(3) and MO{(6) are 
equivalent and the unit defined at MO(3) may (possibly) be 


defined at MO(6), and vice versa. 
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To review notation developed in Chapter 3, a symbolic 
qualification of MO(3) is presented. 
MO(3) <=> MO(6) [MO(3) and MO(6) are equivalent]. 


Op(3) = ADD [operation for MO(3) ]. 


Sk (3) {i ae fs2nk Uni tetors to(3)4: 

e€(3) = 2 {no control bits for /M0(3) ]: 

sc(3) = {3;6} {334} {2 ; 6} {234} [all possible source 
unit combinations, expressed as QMSUs, for MO(3) ]. 

nsc(3) = 4 [number of possible source unit 
combinations]. 

Se(3,1) i= {336} [first (source-level) source unit 
combination for MO(3) j. 

nsc(3,1) = 2 [number of source units in first source 
unit combinations for MO(3) }. 

sc(3,4,2) = {4} [second unit in fourth source unit 
combination for MO(3) ]. 

(6,2) <=> (4,0) {unit {6} defined at MO (2) is 


equivalent to unit {4} defined at MO(0) ]. 
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Table 4.1 ALT for Program Node Given in Figure 3.1 
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The next major section describes the algorithm whose primary 


purpose is the construction of the ALT. 
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4.4 Alternate Unit Identification Algorith 


The ALT is constructed by the algorithm described below. 
It is divided into two passes. Pass 1 is responsible for the 
identification of Type C and Type R alternate source units 
and Type P and fype S alternate Sink units. Pass 2 
identifies all Type A alternate source units. Pass 1 
requires only one examination of each action in the program 
node; and, Pass 2 requires only one examination of each 
alternate sink unit entry defined in Pass 1. Actions are 


examined in sequence within the node and within the ALT. 


In order to represent ail possible combinations of 
Type C and Type R alternate source units and Type P and 
Type S alternate sink units, with only one examination of 
each action, the algorithm must provide the following 
information upon examination of MO(i), 1<5iS<N: sSk(i) must be 
"matched" with all QMSUS previously defined to contain the 
same value; and, ail possible source unit combinations of 
MO(i) which produce results identical to previous actions 
must be identified. Concisely, equivalent sink units are 
combined to identify alternate source units; equivalent 
actions are combined to identify alternate sink units. The 
next section describes the algorithm's general approach to 
the identification of equivalent units and equivalent 


actions. 
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4.4.1 General Description of Algorithm Approach 


The algorithm proposes the unique identification of 
actions by the application of a haShing function [28] to 
(Op(i),sc(i),c(i)). The hashing function may, for example, 
be a Boolean operator function or any residue modulo(n) 
arithmetic technique; also, the hash table may have only a 
prime area or it may be organized as a linked list. The 
hashing function selected is of little importance to this 
study, as it would likely be dependent upon average node 
size, encoding structure, hashing source image, and 
operation distribution factors. For the purposes of this 
study, it is assumed that each unigue hash image, 
(Op(i),sc(i,j),c({i)), when hashed, generates a unigue hash 


table address. 


4.4.1.1 Operation Hash Table {OHT) 


The algorithm constructs an operation hash table, OHT, 
and a unit hash address table, UHA. Each element in the OHT 
is a vector of ordered pairs, QMSUS, which are equivalently 
defined by the action that corresponds to the element 
address; exception: OHT (NOP, (i,j),9) contains a list of OHT 
addresses where unit {i}, as defined by MO(j), is 
represented. For example, the following code is presented 


(note that I(2) and 1I(3) are equivalent actions). 
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I(1): GATE {3} {8} 
I (2): ADD {83 2} {7} 
I(3): ADD {332} {6} 


If the respective OHT addresses are as follows, 


(GATE, (3,0) , 0) = 31 
CADDAGS, 152,.0).,0) = 16 
(ADD, ©@ 932,09) ,9) = 28 
then (8,1) € (31) 
(7,2),(6,3) € (16) 
(7,2) ,(6,3) € (28) 
and OHT(NOP,(8,1),0) = 31 
OHT(NOP, (7,2),0) = 16 28 
OHT (NOP, (6,3),0) = 16 28 


The function of the OHT is to represent those QMSU's 
Which are egquivaiently defined. In order to retrieve 
equivalent unit information from the OHT, a method is 
required for accessing the OHT by unit reference. The unit 
hash address table (UHA), described in the next section, 
provides a way in which a set of equivalent units may be 


found by accessing only one member in the set. 


4.4.1.2 Unit Hash Address Table (UHA) 


With NS memory sub-units, the unit hash address table 
(UHA) is a 2 by NS array where UHA(1,i) contains the index 
of the action which last defined unit {i}; UHA(2,1) contains 
the OHT address which corresponds to the source form of the 


action which last defined unit {i}. For example, if unit {6} 
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Po elast pedefined = ate HO(9)p ‘by (ADD, (3,0;4,2),0), and if 
(ABD, (3-034, 2) , 9) hashes to OHT address LW, then 


UHA(1,6) = 9 and UHA(2,6) = 16. 


The memory management techniques for the storage and 
retrieval of the OHT and UHA elements are not presented. 
Such techniques are dependent upon specific machine and 
Main-memory characteristics and would, therefore, vary with 


different configurations. 


A final note on the hashing function: Ordering of the 
source units, where primitive operations conform to 
commutative laws, eliminates the chance of nonmatches on 
action forms which have the same result. For example, source 
units for primitive operations such as ADD should have a 


defined order of appearance in the hashing source image. 


A detailed description of each algorithm pass is 
presented next. For the pass descriptions and proofs, the 
following gualifications are given: OHT‘'i‘' represents the 
OHT address generated by hashing (Opti) psc; 3),- eu 
OHT(i) represents the vector of ordered pairs at OHT'i'; 
OHT[i] represents the vector of ordered pairs where iis a 
pre-defined OHT address; the symbol, « , is used to 
represent concatenation; and, (x # y) vepresents all x 


except y where xn y # @ is possible. 
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The major portion of the algorithm is dedicated to 
Pass 1. As stated earlier, Pass 1 identifies all Type C and 
Type R alternate source units and ail Type P and Type $ 
alternate sink units for each action. Pass 1 is divided into 
three phases. 

Phase 1.1: Identifies all Type C and Type R alternate 
source units for each action. 

Phase 1.2: Determines alternate sink unit references for 
unmodified data-flow actions; e.g., a GATE micro-operation 
where the control field is null. 

Phase 1.3: Determines alternate sink unit references for 


actions which are not unmodified data-flow actions. 
4.4.2.1 Pass i,) Phase 1.1 (Figure 4.1) 


Phase 1.1 initializes the UHA and OHT before the 
program node is examined; identifies prime and alternate 
source units (Types C and R) for each action; determines the 
hash address of the current action with its prime source 


units; and, updates the UHA for the current action. 


Step 1. The OHT and UHA are initialized before the program 
node is examined. Each unit is identified as if it were 
defined by a trivial SHIFT action; i.e., Op(0) = SHIFT, 
ee(o, a= 4, sk) = 79, ¢(0), = 2. UHA(1, 3) = 0 tor ali j so 
that references to units defined in a predecessor node are 


distinct fron units defined in the current node. 
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GHA(2 55) = \OHT' SHIFT, (470) par: 

GUT (SHIFT, (4,0), 0) = (420); vand, 

OHT(NOP,(3,0),0) = OHT'SHIFT,(j,0),0'. It is noted that if 
the analysis is performed on a regional basis, the UHA(2,j) 
is left unchanged for those units which are busy on entry 


into a node. 


Step 2. Prime and aiternate source units are identified and 
placed in the ALT; and, each prime source unit is augmented 


With its current index of definition. 


For example, if the current action is 
qT (3) ADD {3; 4} {6} 


and the related UHA and OHT values are as follows, 


Units 
1 2 3 4 5 
[ (= a he Cay ls eras Geer ea neo | 
iL 2apeo lied [e5) p33 4 


UHA  $—-}-—-+——-}-—- 4-4 
Wiese (id Gautd2 ap45* p2d0 
a! a a a Sener | 


Gut (i2\) = (2,0): (1,1) (3,4) 


(5,3) (4,5) 


OHT(19) 


then, after Step 2 is completed, 


ALT(9,psc(1)) (3,4) ALP (9 jase (1) )) = (67220) (hig) 


(C,5,3) 


ALT(9,psc(2)) = (4,5) ALT (9,asc(2)) 
and, the field X contains the operation and QMSUS given as 
prime source units: 

y= Apps (474,55) 


The field X is used in Step 3 to form the hash source image. 
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Step 3. The hash address of the current action is 
determined; the prime sink unit is placed in the ALT; and, 
the UHA entry for the prime sink unit is updated to reflect 
its new index of definition. For the example given in Step 
2, the following values are defined in Step 3: 


B = OHT*ADD, (3,4;4,5),0! 


UHA (1,6) ) 


B 


UHA (2,6) 


ALT(9,psk) = (6,9) 


The primary function of Phase 1.1 is the retrievai, from 
the OHT, of Type C and Type R alternate source unit 
information. The variables of form still to be determined 
are Type A alternate source units, and Type P and Type S 
alternate sink unit references. When a unit is defined by an 
unmodified data-flow action, alternate sink unit references 
are easily identified by an examination of QMSUs which are 
eguivalent to the prime source unit given in the action. The 
next section describes Phase 1.2, whose major function is 
the identification of alternate sink unit references for an 


unmodified data-flow action. 
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Phase 1.2 determines if the current action, MO(i), is 
an unmodified data-flow action; if so, sk(i) is made 
equivalent to the prime source unit and all QMSUs equivalent 
to the prime source unit. Then, sk(i) is assigned as an 


alternate sink unit reference. 


Step 1. if the action is not an unmodified data-flow 
action, then Phase 1.2 is bypassed and Phase 1.3 is called; 
1.e., bEUeSk (Ee) <=>) sc(izt) by an unmodified data-flew 
action, Phase 1.2 is executed, otherwise Phase 1.3 is 


executed. 


Step 2. Since an unmodified data-flow action defines the 
Sink unit to be equivalent to the source unit, the sink unit 
must be made equivalent to the source unit and all 
previously defined QMSUs which are equivalent to the source 
unit. This is accomplished when the OHT address, 
UHA (2 ,s¢(7,1)), which corresponds to the source form of the 
action that defined (sc(i,1)) is placed in UHA(2,sk(i)); and 
then, (sk(i),i) is placed at all OHT addresses which contain 
SG(ip1)y “OHT (NOP; (sc(i,1) ,UHA(1,sc¢ (i, 1)))) 70) \-contains »the 
list of OHT addresses where sc(i,1) is contained. This step 
also constructs a vector, E, which contains all QMSUs, 
sc(i,1) is included, that are equivalent to sci: Wisk tbhis 


list is used in Step 3. 


As an example of the operations performed, if I(5) 5 
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the current action in the following program node (note that 


T(2), I(4), and I(5) are equivalent actions); 


Eth: SUB {4;7} {2} 
D (2): ADD (1923 {5} 
Bio: SUB {437} {8} 
Tr(a}: ADD {1; 8} {3} 
jes) GATE {53 {1} 


anger OUT ADD, (1,072,1) 0" = 23 


OHT'ADD, (1,0;8,3),0' = 36 


then OHT(23) = (5,2) (3,4) 


OHT (36) (ai jis na) 

OHT (NOP, (5,2).0) = 23 36 
Step 2 defines the following values, while LS) is 
processed: 


D = 23 36 


E = (374) (5-2) 


OnE (23) = (572) (344) (175) 
OfT (36) = (5,2) (3,4) (1,5) 
UHA(2,1) = 23 


Step 3. This step uses the vector E formed in Step 2 to 
identify those actions to which sk(i) is an alternate sink 
unit. Since sk(i) of an unmodified data-flow action is an 
alternate sink unit for sc(i,1) and all QMSUs equivalent to 
sc(i,1), sk(i) is an alternate sink unit for all actions 
referenced by QMSUs in E. After I(5), in the example given 
in Step 2, is operated upon in Step 3, the following 


alternate sink unit references are defined: 
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ALT (2,ask) 


(S74) (S45) 


ALT (4, ask) (P DAS 25) 


ALT (5,ask) 


(P,2) (P,4) 
The reference addresses for (sk(i),i) are also defined in 
Step 3. For the example in Step 2, 


OHT(NOP, (1,5) ,0) = 23 36 


If the current action is not an unmodified data-flow 
action, then alternate sink unit references are determined 
by another method. This method identifies all units defined 
by actions equivalent to the action currently under 
examination. The general technigue used in this method is to 
hash all possibie source unit combinations for the current 
action; thus, all units defined equivalently to the current 
action's sink unit are identified. Phase 1.3 is responsible 


for the implementation of this technique. 
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4.4.2.3 Pass 1, Phase 1.3 (Figure 4.3) 


— SS eS ee 


Phase 1.3 determines the hash address of each possible 
source unit combination in MO{i); identifies all previously 
defined QMSUS which are equivalent to sk(i); MO(i) is 
assigned aS an alternate sink unit reference at all 
equivalent predecessor actions; and, the OHT is updated to 


reflect the changes created by MO(i). 


Step 1. The hash address of each possible source unit 
combination is determined. This step constructs the vector D 
which contains a list of all hash addresses generated from 
(Op(i),sc(i),c(i)); and, the vector E which contains a list 
of all unique QMSUs stored at the addresses specified in D 


(i.e., a list of equivalent QMSUs). 


After step 1 has operated upon I(6) in the following 
program node (note that I1(2), I(4), and 1(6) are equivalent 


actions), 


I(1): SUB {23 1} {3} 
1 ta): ADD {334} {6} 
1 (3): GATE {3} {5} 
I (4): ADD {435} {8} 
T (5): GATE {4} i} 


T (6): ADD 15373 {3} 
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if OHT*ADD, (3,134,0),0" = 13 
OHT*ADD, (4,075,3) -0t = 26 
ONT ADD, (533775) 406 2= 152 
then D = 52 26 13 
E = (3,6) (6,2) (8,4) 
Step 2. The reference addresses in the OHT, 


OHT(NOP,E(j),0), are updated for all equivalent units 
identified in Step 1; also, previous actions to which 
(sk(i),1) is an alternate sink unit are identified. Since FE 
contains equivalent QMSUs defined at addresses specified in 
D, cveference addresses are updated by the assignment of D to 
OHT (NOP, E(j4) ,0) for each ee Wir Wipes sw at yp dl om 
E(t). =2 (ek @),2), and 2 (2), E(3)7s6<,E (NE) contains a) List or 
previously defined QMSUs which are equivalent to (sk (i) ,i); 
therefore, each action which defined E(j) is a fType P 
alternate sink unit reference for MO(i), and MO(i) is a 
Type S alternate sink unit reference for each action which 


defined E(j), j=2,3,---,nE. 


For the example given in Step 1, Step 2 generates the 


following values: 
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OHT(NOP, (3,6),0) = 13 26 52 
OHT (NOP, (8,4),0) = 13 26 52 
OHT (NOP, (6,2),0) = 13 26 52 


ALT (2,ask) (S,4) (S,6) 


ALT (4,ask) 


(P,2) (S,6) 


ALT (6,ask) fo 2 y (2, 4) 


Step 3. This step, the final one in Pass 1, updates the OHT 
to contain QMSUsS equivalently defined by the current action. 
Since D contains a list of all OHT addresses generated by 
the current action, and E contains all QMSUS equivalently 
defined by the current action, the OHT is updated by the 


placement of E in each OHT location specified by D. 


For the example given in Step 1, the following OHT 
values are defined in Step 3: 


OHT (13) = (3,6) (8/4) (6,2) 


OHT (26) (3,6) (864) (6,42) 


OHT (52) (3,6) (8,4) (6,2) 


Phase 1.2 and Phase 1.3 insure that all Type P and 
Type S alternate sink unit references are identified for 
each action. With alternate sink unit references determined, 
it is possible to identify Type A alternate source units. 
The next section describes Pass 2, which uses the alternate 
sink unit references to determine Type A aiternate source 


units. 
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Pass 2 is designed to identify all Type A alternate 
source units by an examination of Type P and Type S 
alternate sink units. Phase 2.1 examines the alternate sink 
unit specification in the ALT, in order to identify Type A 


alternate source units. 
4.4.3.1 Pass 2, Phase 2.1 (Figure 4.4) 


Phase 2.1, the final phase in the algorithm, examines 
each action represented in the ALT, from first to last, to 
determine if alternate sink unit references are present. If 
so, these references are used in the determination of Type A 


alternate source units. 


Step 1. Selects the alternate sink unit entries for each 
action (sender), and then determines the prime sink unit for 
the action (receiver) to which an alternate sink unit may be 


assigned. 


Step 2. Examines the ALT, in sequence, from the receiver to 
the last action represented in the ALT. If the prime sink 
unit for the receiver action is a prime or alternate (Type C 
or Type R) source unit, then the prime sink unit of the 


sender is identified as a Type A alternate sink unit. 


Each pass of the algorithm has been described in detail. 
The next major section offers proof that the algorithm 


identifies all alternate and equivalent action forms. 


en ried 7 aviees iy 
(aD ee thaw akg SR | -dintot 


a oo ol en | re 20504 3 3 gn? tou 


4 aT 1 ; - i Gant2 - aed s5ndd 


: . “Ph : 


gages i >? * Lan ded arnavwaberts nine 


> w J a 
iy gdot get i us. yt Sik Ge <3 note _ 
7 —- it 1806 staan 
i it i 

ca ay Si4u ‘as mm OQQt is sgudee 7 
. a —See 
iv, en ahi li Soule & Ah Gage Greer > (aeguse}@ 33 

i) 

So Von : ett »& Ascae ** ( sev he ee | —— 


1) 


>) Tod 7 ew ng =o et iVis@t rt arte. oT auetauen 


ss. j ba? if pal © iz ay imrnedeyes soi?*1b 9988 
Eay < : nay bc sehen oa? = 
a iine iii e _ 


oe ) a 


“ 1s2 
7s 


“bis os ot Goufatew 
RO fo 


hana i i ci ‘* 


om ii 


dd 


(ss epee erie raer | 
{Phase 2.1] — 
a aaa 174 
a 
\i 
ce | y 
{ i=0 | 7 sea ale 
ore {k=k+1{ 
OY KX 7 jm=O | 
{ Vv | j 
{ ct | i 
| {i=i+1] | y 
i fj=0 j i e 
{ Sn pa { * «6 
{ { { > e e < 
| y haan OTN e——__________, 
| 6 s r 
| md od 2 @ : 
j = e > c——4 e a 
{ ® 1:N #——->{END{ {m=n+1] 
| r © | Ce | ts 
j eS © | 
| e y 
i [s e 
1 i e«¢ 
i y @ e 
{ 4 7 > emie 
i igs el {1}<——-e nsc ® 
: ei bas oa e(i,1)¢ 
{ s e 
{ V ee 
{ e © 
I » ® i< 
{ * e i 
j @ ] Y 
a ai : 
te :nALT @ {|G=ALT(k,psc(m) ) | 
e(i,ask) ¢ {| «# ALT(k,asc(m)) | 
e e a 
e 3 { 
® @ y 
is ee 
| yes e © no rr 
Y ———————_*, peg *—___ 411 
r Seen A Hes 3 g & cc! 
{T=jth ordered pair} Y ee 
(eCing ALT fivask) a e 
| bD=ALT (i, psk) t>41{ [ALT (k,asc(m)) = | 
|p=ALT(T(2),psk)) | 4 { ALT(k,asc(m)) #1 
| k=T (2) | (i eb) eo tale 
(22222 ee oes 


Figure 4.4 Flow Chart for Pass 2, Phase 2.1: Type A 
Alternate Source Unit Identification 
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4.5 Alternate Unit identification Algorithm: Proof 


The general approach used to prove that the algorithn 
identifies all alternate units is to state and prove 
assertions which support the intention of each pass. For 
Pass 1, proof is given that all Type C and Type R alternate 
source units and all Type P and Type S alternate sink units 
are identified; for Pass 2, proof is given that all Type A 
alternate source units are identified. Before a proof is 
offered for each pass, three theorems are presented which 
are basic to the algorithm. Throughout the proofs, the 
acronym "UDF" is generally used in place of the more verbose 


“unmodified data-flow action". 


Theoren 4.1 Given N actions in a program node: If 


MO(i) <=> MO(3j) and i< j, then (sk(i),1) is defined at 


EOS ey) (Ope) rsc(j),o(G)) » ISi<N, I<9S%- 


Proof: 

Frompdet sai, for MOQ) and MO(3) # UDF, 
Ob fue 2 98OR (3), 1clije = €())y, Sand ise (sal) <=2asc (gan) 
frvecall, sc(i,1) references the Ist source unit 
combination at MO(i) ]; 
and, since equivalent units are transitive, 
sc(i) <=> sc(j)- 
Since i < j and equivalent units are qualified by their 


index of definition, sc(i) € sc(j)- 
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Tf se(i) € sc(j) and MO(i) <=> mo({j), 
then sc(i) gd sc({j) defines an additional set of source 
unit combinations which identify action forms that may 
define (sk(i),i). 
porseeicher (MOCijacor eNO) = UDF, (sk (4),1) 4s obviously 
defined at MO(j); therefore, where MO(i) <=> MO(j) and 
i<j, (sk(i)-i) is defined at MO(j) by (Op(j),sc(j)sc(j)), 
TIN 1S SNS 


Q.E.D. 


Theorem 4.2 Given N actions in a program node: If ail 
Type P alternate sink units are identified at MO(j), then 
all Type S alternate sink units may be identified at MO(i), 


fad eee ON, Ht 2, <<a pN- 1. 


PEOofs 
From def 4.4, MO(j) is a Type P alternate sink unit 
reference at MO(i) 
Prt a | and 
MO(i) <=> MO(j); 
feomw oder V4H55, MO(j) is a Zype S alternate Sink unit 
reference at MO(i) 
aff 1 3jeand 
MO(i) <=> MO(j). 


Q.E.D. 


Theoren 4.3 The identification of Type C and Type R 


eS eee ee 


alternate source units is sufficient to identify all Type P 


and Type S alternate sink unit references in a program node. 
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Proof: 


From def 4.1 and 4.2, it can be seen that all Type C and 
Type R alternate source units are identified only if all 
equivalent predecessor actions are identified. From 
Theorem 4.2, the identification of all equivalent 
predecessor actions is sufficient to identify all equivalent 
actions; therefore, the identification of all Type Cc and 
Type R alternate source units is sufficient to identify ail 
Type P and Type S alternate sink unit references ina 
program node. 


Q.E.D. 


Corollary 4.3 The identification of all Type C and Type R 


alternate source units is sufficient to identify all 


equivalent actions in a program node. 


PEoof: 

Clearly, Type A alternate source units cannot form new 
action eguivalencies; therefore, Corollary 4.3 is immediate 
from Theorems 4.2 and 4.3. 


QED. 


The theorems presented are used throughout the proofs 
given in the next two sections. The following section offers 
proof that Pass 1 of the algorithm identifies all Type C and 
Type R alternate source units and all Type P and Type S 


alternate sink units. 
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To show that Pass 1 identifies all Type C and Type R 
alternate source units and all Type P and Type S alternate 
Sink units, the following assertions are made (assume N 


actions in a program node and NS memory sub-units). 


Assertion 4.1 Each possible and different form for MO(i) is 


uniquely identified, i=1,2,...,N. 


Proofs 

The hashing function is applied to each (Op(i),sc{i,j) ,c (i)) 
for j=1,2,...,nsc(i) [nsc(i) means number of source unit 
combinations for MO(i)]; and, where commutative laws apply 
to Op({i), there is a defined order of representation for 
sc(i,j). Therefore, since each unique (0Op(i),sc(i,j),c(i)) 
hashes to a unique address, each possible and different form 
of each MO(i) is uniquely identified by its corresponding 
OHT address. 


Q.E.D. 


Assertion 4.2 If MO(i) redefines unit {k}, all references 
to previous definitions of unit {k} are differentiated from 
all references to the MO(i) definition of unit {k}, IS1i<N, 


1<k<NS. 
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Proof: 

Since each unit is augmented by its index of definition, 
references to the MO(i) definition of unit {k} are distinct 
from references to previous definitions of unit {k}; ise., 
the MO(i) definition of unit {k} is referenced as (k,i). 


Q.E.D. 


Assertion 4.3 UHA(1,k) contains the most recent index of 


definition for unit {k}, 1<k<NS. 


Proof: 
Actions are examined in first to last sequence within the 
program node and all units have an index of definition = 0 
upon entry into the program node. 
Roret=d, 2,a5-, N 

UHA(1,sk(i)) := i; 
therefore, UHA (1,k) always contains the most recent 


(current) index of definition for unit {k}, 1<k<NS. 


O. Bs Da 
Assertion 4.4 For the examination of MO (t) where 
MO(t) = unmodified data-flow action, the generation of 


entries at OHT'Op(t),sc(t),0' is unnecessary, t=1,2,...,N. 
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Proof: (assume sc(t,1) = {e}) 
(recall, OHT(i) means all elements at OHT address generated 
by hashing i; OHT'i' means OHT address generated by hashing 
ie OHT[ i]. neans  elenentsegat ¢sOHTs,address sis is means 
concatenation; and, (x # y) means all x except y] 
When MO(t) = UDF and sc(t,1) is represented as (e,UHA(1,e)): 
D := OHT (NOP, (e,UHA(1,e)) ,0) 
where OHT(NOP, (e,UHA(1,e)),0) contains a complete list 
of OHT addresses that correspond to action forms which 
may define (e,UHA(1,e)). 
FOr mw=1,2,-<. ,uD 
OHT[D(m) ] := OHT[D(m) ] » (sk(t),t); 
thus, each OHT entry which corresponds to an action form 
that may define (e,UHA(1,e)) is updated to reflect the 
eguivalent definition of (sk(t),t). 
Gum(NOR, (sk (t\-, £)7 0h =10;3 
thus, OHT(NOP,(sk(t),t),0) contains a complete list of 
OHT addresses which correspond to action forms that may 
define (e,UHA(1,e)) and (sk(t),t). 
UHA (2, sk (t))) s=°UHA (2,7); 
thus, OHT[UHA(2,sk(t)) ] contains a complete last vot 
QMSUs which are equivalent to (e,UHA(1,e)) and 
(sk(t),t)- 
If OHT(NOP, (e,UHA(1,e)),0) contains a complete list of OHT 
addresses that correspond to action forms which may define 
(e,UHA(1,e)); and, if OHT[UHA(2,e) ] contains a complete list 


of QMSUsS which are equivalent to (e, UHA(1,e)), then 
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(sk(t),t) is made equivalent to (e,UHA(1,e)) and all QMSUs 
equivalent to (e,UHA(1,e)). Therefore, for the examination 
of MO(t) where MO(t) = UDF, the generation of entries at 
OHT'OP (t),sc(t),0' is not required for the identification of 


equivalencies, t=1,2,...,N. 


Assertion 4.5 For the examination of MO (t) where 
MO(t) # unmodified data-flow action, OHT (NOP, (3,3) ,9) 


contains a complete list of OHT addresses that correspond to 
action forms which may define (i,j), 1Si<NS, 1<jsN, 


5h 5og See rear 


BEeot: 
When (i,j) is defined at MO(j) and MO(j) # UDF: 
ONT (NOP, (1,74) ,+0) = D 
where D := D w OHT'Op(j),sc(j,m),c(3)' 
OG Bole Z gee sy SC (5) 
Thus, for the examination of MO(t) where t = j, 
OHT(NOP, (i,j) ,0) is complete, 1<i<NS, 1S5j<N. 
When (k,s) is defined at MO(s), s > j, MO(s) <=> MO(j), and 
MO(s) # UDF, 1SK<NS: 
OHT (NOP, (1,4) ,0) := D 
where D := D w» OHT'Op(s) ,sc(s,/m),c(s) ' 
£00 M=1927 eu ,USC (5S) « 
From Theorem 4.1, sc(j) € sc(s), and sc(j) A sc(s) 
defines an additional set of source unit combinations 


Which identify action forms that may define. “(1 74); 
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therefore, OHT(NOP, (i,j),0) © D. {(D # OHT(NOP, (i, 73) ,0)) 
represents all OHT addresses which correspond to action 
forms identified by (9p(s),sc(j) & sc(s),c(s))- From 
Assertion 4.3, no OHT addresses are generated when 
MO(i) = UDF; therefore, for an examination of MO(t) 
WRETe UPL =jt iy jh 27s. 7 Neo OHT (NOP, (i,j),0) is complete, 
PSTN ta yf ie toys ee Ne 
Cieapty ,PULoLi,the, examination ‘of “MO(E)” forage ty and 
t=j+1,j+2,..-,N, OHT(NOP, (i,j),0) is complete; therefore, 
for the eXamination of MO (t) where MO(t) # UDF, 
OHT (NOP, (i,j),9) contains a complete list of OHT addresses 
that correspond to action forms which may define Gp). 
ISI SNAP ISqSNpet=jej+4h. ep he 


Q.E.D. 


Assertion 4.6 For the examination of MO(t) where (i,j) is 
defined at MO(j) and MO(j) = unmodified data-flow action, 
OHT(NOP, (i,j) ,9) contains a complete list of OHT addresses 
which correspond to action forms that may define (2 


Proof : 

When MO(s) <=> MO(j), Ss > j- MO(j) = UDF, and MO(s) # UDF: 
OHT(NOP,E(j),0) := D 
where D := D » OHT'Op(s),sc (s,m),c(s)' 
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and, E := E » (OHT[ D(m)] # BE) 
forgma(5 294 .o-nDs 
EOL Stag piece pies 
From Assertion 4.3, (Si (ale, pocee : therefore, for 
HO (Ss) <=> NO(4) s -S*>"j,. MO(j).= UDF, and MO(s) ¥ UDF, 


OHT (NOP, (i,j) ,0) is complete. 


When (i,j) is defined at MO(j) and MO(j) = UDF: 


For 


and 


of 


OHT (NOP, (4735) ,9) == CHT (NOP, (sc (j,.1),0HA(1,se(i, 1) 5 70) 
Thus, for the examination of MO(j) where MO(j) = UDF, 
the completeness of OHT(NOP,(i,j),9) is dependent upon 
the completeness of OHT (NOP, (sc(j,1),UHA(1,Ssc(j,1))),90)- 
It can be seen that convergence wili occur (i.e., 
completeness can be proven) if the extreme dependency 
case is considered (i.e., a program node which contains 
only unmodified data-flow actions). For the extreme 
case, there exists a (z,0) which is defined at MO(0) 
and, by the initialization in Phase 1.1, MO(0) # UDF; 
therefore, OHT(NOP,(z,0),0) can be proven to contain a 
complete list of OHT addresses that correspond to action 
forms which may define (z,0). 

the examination of MO(t) where (i,j) is defined at MO(j) 
MO(j) = UDF, OHT (NOP, (i,j),0) contains a complete list 


OHT addresses which correspond to action forms that may 


define (i,j), t = j and t=j+1,j+2,...,N, 1SiSNS, OSjSN. 


Q.£.D. 
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Corollary 4.6A(1) From Assertions 4.5 and 4.6: 

For the examination of MO(t) where (i,j) is defined at 
MO(j)/ OHT (NOP, (i,j),0) contains a complete list of OHT 
addresses that correspond to action forms which may define 


(leis (SNS Bl So ee P+ 1 2s wns 


Corollary 4.6A(2) From Assertions 4.5 and 4.6: 
For the examination of MO(t) where (i,j) is defined at 
MO(j), OHT(NOP, (i,j),0) contains no OHT addresses that 


correspond to action forms which may not define (i,j), 


Fotons is iche t=4, jt1os-., Ne 


Assertions 442i) For the examination. of, NO (e) zon(i/j)M us 


contained at all OHT addresses that correspond to action 


FOrnsawhteh may ideLine (1.74), tSiSNS, OS51<5N, t=j,j*l,. <6 Ne 


Proots 
When (i,j) is defined at MO(j) and MO(j) # UDF: 
OHT (OP (3) ,Ssc(j,m),c(j)) 2= OHT(Op(j),Sc(5,m) ,c(5)) # (1,5) 
POLMUH Wt yoy ciecig i SC (5). 5 
thus, for the examination of MO(t) where t = j and 
MO{j) # UDF, (i-j) is contained at all OHT addresses 
that correspond to actions which may define (i,j), 


1<i<NS, OSj<N. 
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When (kys) is defined at sMO(s) , MO(j) <=> HO(s),%s°> 4, and 
MO(s) # UDF: 

Foran= (72,2. ,usc(s) 

CHT (Op(sS),sc(S,m),c(S)) := E 
where EF := E aw {OHT(Op(s),sc(s,a),c(s)) # E) 
PORN AU giee'at gp SCS). 

From Theorem 4.1, sc(j) € sc(s), and sc({j) «a sc(s) 

defines an additional set of source unit combinations 

which identify action forms that may define (i,j); and, 

Since E contains all QMSUs at OHT(Op(s),sc(s),c(s)), 

(i,j) € E. Therefore, for an examination of MO(t) where 

t=ghlpit2scoa,N ©. and’ MOG) UDF, (277) is contained at 

all OHT addresses that correspond to action forms which 

may define (i,j), 1Si<NS, OSjSN. 
From Assertions 4.4 and 4.6, it can be seen that for ({i,j) 
defined at MO(t) where MO(t) = UDF, (i,j) iS contained at 
all OHT addresses that correspond to action forms which may 
define (i,j), t=j,j+1,--.,N; therefore, for the examination 
of MO(t);, (79) “ts ‘contained. Jato all OHT addresses ‘that 
correspond to action forms which may define (i,j), 1Si<NS, 
US sNeatH titty s.s, Ne 


Q.E.D. 


Corollary 4.7A(1) From Assertions 4.4, 4.6, and 4.7: 
(i,j) is contained at no OHT address that corresponds to an 


action form which may not define (i,j),1SiSNS, OSJSN. 
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Corollary 4.7A(2) From Assertions 4.4, 4.6, and 4.7: 
OHT(a) Contains a complete list of QMSUs that may be defined 
by the action form to which OHT (a) corresponds, 1sa<nsc(i), 


Beis! 9 «6 sr¢ Na 


Assertion 4.8 OHT[{UHA(2,k) ] contains a compiete list of 


QMSUS which are equivalent to the current definition of unit 


{k}, 1Sk<NS. 


Proof: (assume UHA(1,k) = x) 
Where (k,x) is defined at MO(x) and MO(x) # UDF: 
OHM (2 Kyo = SOUT "Op (x), Se(xe1) cc (zj4a 
From-corollary 4.7 (2), GHT (Op (x) ¢sc(x,1) ,a(%)) contains 
a complete list of QMSUs that may be defined by the 
action form to which OH? (Opitx)i,.sc (x, 4), c an) 
corresponds; therefore, for MO(x) # UDF, OHT[ UHA(2,k) ] 
is complete, 1<k<NS. 
When (k,x) is defined at MO(x) and MO(x) = UDF: 
UAA (27K s— UHA(2Z,sc(x,1)) - 
From Assertions 4.4, 4i6.and Corollary 4.7A (2), it can 
be seen that for MO (x) = UDF, O#T[UHA (2,k) ] as complete, 
1<SkK<SNS. 
Since OHT[UHA(2,k)] is complete for MO(x) = UDF and for 
MO(x) # UDF, OHT[UHA(2,k) ] contains a complete list of QMSUs 
which are equivalent to the current definition of unit {k}, 
1<kK<NS. 


Q.E.D. 
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Assertion 4.9 For the examination of BO (By ji sall Fyps Ce xand 


Type R alternate source units may be identified, 


Ge caltal tesaa,¢ Ne 


Proof: 

From Assertion 4.8, all QMSUs eGare lene to bese (47174) ane 
contained at ONTRUEA(2,S¢ 7475) ],e) gete2pe-.pnse Gali 
adie gisiesi sin 


Q.E.D. 


To show that Pass 1 identifies all Type P and fype $ 


alternate sink units, the following assertions are made. 


Assertion 4.10 For the examination of MO(i), all Type P 
alternate sink unit references may be identified, 


RUG Z geverecg Ns 


Proof: 

From Assertion 4.9, it can be seen that all equivalent 
action forms may be identified; and, from 
Coroblaries 4.60 (2) and 4.7A (2), it can. be seen that the 
index of all equivalent predecessor actions ma y be 
identified. Therefore, for the examination of MO(i), all 
Type P alternate sink unit references may be identified, 
Red 2yee.a ps (Assertion 4.10 is also immediate from 
Theorem 4.3) 


Q.E.D. 
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Assertion 4.11 For the examination of mO(i), all Type Ss 


alternate sink unit references for MO(j) may be identified, 


a, 2 a BA me mee ee 


Proof: 

From Assertion 4.10 and Theorem 4.2, it can be seen that for 
the examination of MO(i), all Type S alternate sink unit 
references 1 He M0(j) can be identified, i1=2,3,..<,%; 
Se ae Real Be 


Q.E.D. 


Assertions 4.1 through 4.11 have been offered to show 
that Pass 1 identifies all Type C and Type R alternate 
source units and all Type P and Type S alternate sink units. 
The variables of form for actions are Type C, Type R, and 
Type A alternate source units and Type P and Type §S 
alternate sink units. The following section offers proof 
that Pass 2 of the algorithm identifies all Type A alternate 


source units. 


To show that Pass 2 identifies all Type A alternate 
source units, the following assertion is made (assume N 


actions in a program node and NS memory sub-units). 


Assertion 4.12 If all Type P and Type’S alternate sink 


units are identified at MO(i), then all Type A aiternate 


source units may be identified, i=1,2,...-,N- 
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Proof: 
Fuonedef 424 and 4:25; 
If QMSU (m,x) is an alternate sink unit for MO(k), then 
MO(x) <=> MO{k); 
and, if (t,s) is an alternate sink unit for MO(k), then 
MO(k) <=> MO(s). 
Therefore, sk(x) = m, MO(x) <=> MO(k), 
and MO(k) <=> MO(s); and, the conditions given in def 
4.3 are satisfied for 1Sm,t<NS, 1<x,k,s<N. 
From def 4.3, 4.4, 4.5, and Corollary 4.3, it can be seen 
that if all Type P and Type S alternate sink units are 
identified at MO(i), then all Type A alternate source units 
Gavevemidentitied, 1=1,2,< «<7, Ne 


Q.E.D. 


The alternate unit identification algorithm has been 
described in detail and an algorithm proof has been offered. 
The next major section describes the relationship between 
the ALT, constructed by the algorithm, and machine data~flow 


fan-out capabilities. 
4.6 ALT and Hardware Fan-out 


The algorithm presented in this Chapter identifies all 
equivalent actions. These equivalent actions are represented 
at AL@(i,;asky for i=1;2,...,N- It can be seen thet Whe 
MO(i) <=> MO(j), then sk(i) is a fan-out candidate at MO(j) 
and sk(j) is a fan-out candidate at MO(i) [52]; therefore, 


the information collected in ALT(i,ask) may be used to 
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delete actions by the exploitation of parallel hardware fan- 
out capabilities. Also, the identification of equivalent 
actions establishes the basis for alternate action forn 


identification. 


This Chapter presented and proved the correctness of an 
algorithm which identifies all possible forms of each action 
in a program node. The ALT has been structured to contain 
the information which defines alternate action forms. In 
order for the algorithm to construct the ALT, alternate 
units were qualified as to type, and a hashing technique was 


proposed. 


The algorithm described was coded using APL/360. The 
hashing function used for the program node given in 
Figure 3.1 was a residue modulo(213) arithmetic technique. 


There were 37 entries generated in the OHT (NS = 9). 


With reference to the solution approach described in 
Section 3.3, this Chapter has presented an algorithm for the 
determination of all possible forms, F(m), of action(m), by 
the identification of equivalent actions and equivalent 
Units, m=1,2,«.<,N. The next Chapter defines the minimal set 
of conditions required for an action to be a _ nonessential 


candidate for deletion. 
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CHAPTER V 


DELETION CANDIDACY 


5.1 Introduction 


a ee SS eee 


In Chapter 3, nonessential actions were defined in terms 
of particular states (0,122, mOre Ss) ot WRC ban Melersun ee 
assignment table (UA). In Chapter 4, an algorithm was 
presented which identifies all possible alternate action 
forms. The purpose of this Chapter is to describe a minimal 
set of conditions, within alternate forms of the program 
node, which must exist in order to delete an action as 
nonessential; i.e., alternate action forms are used to 


identify candidates for deletion. 


The general concept displayed in this Chapter is as 
follows: Through the assignment of alternate source and sink 
units, UA states are determined which represent nonessential 
unit definitions. Section 5.2 describes the conditions 
required to change a particular state in the UA. fhen, 
Section 5.3 shows now an action is deleted by the assignment 
of alternate source and sink units; i.e., conditions 
reguired for changing a set of states in the UA are defined. 
Included in Section 5.3, proofs for the sufficiency of 


conditions are offered. 


It was found that order-isomorphisms may exist in 


deletion conditions. Section 5.4 describes the cases where 
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these may occur and presents a basic theorem for the 


analysis of reciprocally dependent deletion conditions. 


A Simple example of deletion candidates is presented in 
Section 5.5. The example illustrates some of the concepts 
developed in this Chapter. From the deletion conditions 
described, it is possible to define deletion strategies for 
nonessential actions. These strategies will be discussed in 


Chapter 6. 
5.2 Conditions Required 


The purpose of this section is to describe those 
conditions which must exist, in order to change a_e state 
within the unit assignment table. In particular, states are 
to be changed so as to create a nonessential unit definition 


in the program node. 


From an examination of the nonessential action 
definitions given in Section 3.3, it can be seen that the 
following state changes in the UA may be required for the 
deletion of an action. 

1. State 1 changed to state 0: The deletion of a unit 
from its participation as a source unit for an action. 

2. State 3 changed to state 2: Same as 1, above. 

3. State 2 changed to state 0: The deletion of a unit 
from its participation as a sink unit for an action. 


4. State 3 changed to state 1: Same as 3, above. 
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Those conditions which are required to make state 
changes 1 to 0 and 3 to 2 are as follows: The existence of a 
feasible alternate source unit; or, the deletion of an 
action from the program node. Those conditions which are 
necessary to make state changes 2 to 0 and 3 to 1 are as 
follows: The movement of a unit definition to another 
action; or, the deletion of an action from the program node. 
Section 5.3 defines the conditions required for an action to 
be deleted. The conditions of feasibility for an alternate 


source unit are described next. 


5.2.1 Alternate Source Unit Assignment Feasibility 


An alternate source unit, ALT (i,asc(j,k)), is feasible 
for assignment oniy if conditions exist which insure that 
the unit is equivalent at MO(i) to the unit in 
ALT G ,psec(j)), Sis, 1<4<use(i,1), 1<k<nALT(i,asc(35)). 


These required conditions are given in the definition below. 


def. 5.1 Feasible alternate source unit: Unit {i} is a 
feasible alternate source unit for unit {t} at MO(x), 1SxSN, 
if, and only if, one of the following conditions is 


satisfied. [Assume unit {t} is defined at MO(s) ] 


Ea (t,;s) € ALT (x,psc(a)), OSsSN-1, tSasnse (x7 1)5 


(C;inj) € ALT (x;asc(a)), 0S3SN-1. 


Za (t,s) € ALT (x,psc(a)), OSSSN-1, 1<as<nsc(x,1);3 
(R,i, ji) € ALT(x,asc(a)), O<jSN-2; 
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where UA(m,sk(j)) > 1, 

MO (m) is a negated-forward candidate for 
deletion; or, MO(m) is a parallel-forward 
Candidate for deletion with respect to MO(y), 


such that y 2 x. 


ae (t,s) € ALT(x,psc(a)), OSs<N-1, 1S5a<nsc(x,1); 
(A,i,j,k) € ALT(x,asc{a)), 1<j<SN-1, 15k<N-1; 
MO(k) iS a parallel candidate for deletion with 
Pespecerco. MO (5) 3 
EGE sRSjtl jt Ap. pe— 1 
Where UA(m,sk (k)) > 1, 
MO (m) is a negated-forward candidate for 
deletion; or, MO(m) is a parallel-forward 
candidate for deletion with respect to MO(y), 


such that y 2 x. 


The conditions for Type C, Type R, and Type A alternate 
source unit feasibility have been presented. The conditions 
required for the movement of a unit definition to another 
action are given in the section which describes parallel- 
forward and parallel-backward deletion conditions. Before 
the conditions for deletion candidacy are defined, it is 
necessary to describe condition conflicts; i.e., those cases 
where deletion conditions for an action may violate other 


deletion conditions for the same action. 
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9.2.2 Condition Conflicts 


A basic requirement for the deletion of an action is 
that conditions for its deletion may not conflict with each 
other. For example, assume that MO(x) is examined as a 
posSible negated-forward candidate for deletion. Further 
assume that its deletion requires UA(i,sk(x)) be changed 
from state 1 to state 0, and UA(j,sk(x)) be changed from 
State 3 to state 2. if the condition used to change the 
state in UA(i,sk(x)) is the assignment of alternate source 
unit (t,s), and the condition used to change the state in 
UA(j,Ssk(x)) requires the deletion of MO(S); then, the first 
condition reguires the definition of unit {t} at MO(s), and 
the second condition requires the deletion of MO (Ss). 


Clearly, these two conditions are in conflict. 


Simply stated, conditions are in conflict with each 
other, if each condition requires a different form of the 
same action (a deleted action is considered the null form of 
the action). It is important to note that conditions which 
define alternate source unit feasibility are also considered 


in the identification of conflicting conditions. 


In order to identify a deletable action, it is necessary 
to define a minimal set of conditions for its deletion 
candidacy. Furthermore, these conditions must not conflict 


with each other. The next section describes the conditions 


for deletion candidacy. 
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5.3 Conditions for Deletion Candidacy 


SS a 
— SSS SS = 


The purpose of this section is to define a minimal set 
of conditions which must exist, in order for an action to be 
deleted. Conditions for deletion candidacy are based upon an 
examination of each action independently; i.e., examination 
is given to each action as if it is the only action to be 
deleted from the program node. Of course, the deletion of 
one action iS a dependent event, as it may affect the 
deletion status of other actions. The analysis of deletion 
dependencies among different deletion candidates is deferred 


to Chapter 6. 


Along with each set of deletion conditions, a proof is 
offered. The proof shows’ that the conditions, when 
Satisfied, are sufficient to identify all actions which 
conform to the nonessential action definitions given in 
Section 3.3. Before conditions and proofs are given, two 
original and basic theorems are presented. The intent of 
these theorems is to show the range of actions over which 


deletion examination is to be given. 
5.3.1 Deletion Range Theorems 


Theorem 5.1 Given a program node: An examination of all 


equivalent actions is sufficient to identify all parallel 


and redundant actions. 
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Pxoot: 

From def 3.10 and 3.11, only equivalent actions may be 
parallel; and, from Theorem 3.1, the set of redundant 
actions is included in the set of parallel actions. 
Therefore, an examination of equivalent actions is 
sufficient to identify all parallel and redundant actions. 


Q.E.D. 


Theorem 5.2 Given a program node and NS memory sub- 


units: An examination of adjacent definitions of unit {k} 


is sufficient to identify all negated definitions of unit 


{k}, 1<k<NS. 


ERoGi < 
An examination of def 3.14 shows that MO(i) is negated- 
forward with respect to MO(j) 
Pet Ske = ski): 
pe ag Es 
j 
2 UA(m,Sk(i)) = 2. 
m=i+1 
If MO(j) is negated-forward with respect to MO(t), then the 
MO (j) definition ort sk (3) may be deleted; thus, 
UA (j,sk(j)) = 0. If MO(i) is negated-forward with respect to 


MO(j),- and if MO(j) is negated-forward with respect to 


MO(t), then 
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i.< Fo 
Sk(i) = sk(t); and 
ic 
x UA (GU Sk GL) = 2. 
m=i+1 
Therefore, an examination of adjacent unit definitions is 
sufficient to identify all negated unit definitions. 


Ocrie Ds 


it is intended that examination be given to each action 
in a first to last sequence within the program node. 
However, this order is not essential, since actions are 
examined independently. Deletion conditions are defined 
first, for parallel-forward candidacy; second, for parallel- 
backward candidacy; and third, fOr negated-forward 


candidacy. 


5.3.2 Parallel-Forward Deletion Candidacy 


MO(i) is a paralleli-forward candidate for deletion wita 
respect to MO(j) if, and only if, the following minimal set 
of conditions are satisfied and no condition conflicts 


occur, 1Si1SN-1, 1<jSN. 
Toe(S, jhe Auta, ask). 


2. Either sk(i) may be fanned-out at MO(j), 
or sk{i) = sk{j5), 
or MO(j) is a candidate for deletion and sk(i) 


can be used as a sink unit for Op(j)- 
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where S(sk(1) 2), "€ ALT (@ psc (r)), 1<r<nsc(m,1 

at least one of the following condition 

satisfied: 

a) There exists a feasible alternate 
ALT (m,asc (Lr) ) 3 

b) MO(m) is as- negated-forward candidate 
deletion; 

c) MO(m) is a parallel-forward candidate 
deletion; 

qd) MO(m) is a parallel-backward candidate 


deletion. 


HE eat ot eer Ge 

where (sk(i),i) € ALT (m,psk), 

at least one of the foilowing conditions 

satisfied: 

a) MO(m) is a negated-forward candidate 
deletion; 

b) MO(m) is a parallei-forward candidate 
deletion with respect to MO(s), such 
Sez 3% 

c) MO(m) is a parallel-backward candidate 
deletion with respect to MO(s), such 
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Proof of sufficiency: 
Condition 1 states that (S,j) € ALT(i,ask). Therefore, from 
def. 4.5 and the alternate unit identification algorithm in 


Chapters4, MO(1)> <=> NO(j) and a < 4. 


Condition 2 insures that sk(i) may be fanned-out at MO(j), 
Or sk(i) may replace sk{j) at MO{j). If sk(i) = sk(4), it is 
obvious that sk (i) hay replace’ ’sk({j) Sateeno(s) .SaLe 
Sk(i) # sk(j), and sk(i) may not be fanned-out at MO(j), 
then sk(i) may replace sk(j) at MO(j) if, and only if, MO(j) 
is a deletion candidate. Also, sk(i) must be viable as a 
Sink unit for Op(j). The viability requirement is necessary 
for processors whose "data-flow" sink units may differ from 


arithmetic sink units. 


Condition 3 insures that if sk({i) is a prime source unit at 

MO(m) where i<m< j, it may be replaced by an alternate 

source unit; or, MO(m) may be deleted. If the alternate 

assignments are made at MO(m), or if MO{m) is deleted; then, 
OD oil tly. cle 


UA(m,sk({i)) = 0]2. 


Condition 4 states that where sk(i) is a prime sink unit at 

MO(m) for i<am< j, the MO(n) definition of sk(i) may be 

deleted or moved outside the range of deletion; therefore, 
CO Maat tpt eS patars gel | 


UA(m,sk(i)) < 1. 
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Conditions se vandeeconditione 4s together, yield the following 


parallel-forward deletion requirements: 
471 
yy UA(m,sk(i)) = 0; 
m=it1 


UA (j,sk G) ) =0 p22: 


Q.E.D. 


From Theorem 5.1, if there are t Type S alternate sink 
unit references in ALT(i,ask), then MO(i) is examined a 
maximum of t times for its deletion candidacy as a parallei- 
forward action. If MO(i) <=> MO(j) <=> MO(k), i < j < k, and 
MO(i) is not a parallel-forward candidate for deletion with 
respect to MO{j), then MO(i) cannot be a parallel-forward 
candidate for deletion with respect to MO(k). An exception 
to this is as follows: The only required and unsatisfied 
condition is that sk(i) replace sk(j).- Conditions for 


parallel-backward candidacy are presented next. 


MO(j) is a parallel-backward candidate for deletion with 
respect to MO(i) if, and only if, the following minimal set 
of conditions is satisfied and no condition conflicts occur, 


1<i<N-1, 1<j<N. 


Ue (P,i) € ALT (j,ask). 
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Either sk(j) may be fanned-out at HO (ay, 
or SK(G) =ésk (2); 
or MO(i) is a candidate for deletion and sk(j) 


can be used as a sink unit for Op(i). 


POr mati ,dt2,..-,) [a = i for sk) ¥ skys a2 = 9k 
of first, PUA (K sk (j)) > 1 (Ra tg eraser) for 
sk(j) = sk(i) ], 
where (sk(j),j) € ALT(m,psc(r)), 1SrSnsc(m,1), 
at least one of the following conditions is 
satisfied: 
a) There exists a feasible . alternate = 8 i 
ALT (M,asc (r));3 
b) MO(m) is a negated-forward candidate for 
deletion; 
c) MO{m) is a parallel-forward candidate for 
deletion; 
d) MO(m) is a parallel-backward candidate for 


deletion. 


POU Bait belt 72 yeas pt 
where (sk(j),j) € ALT(m,psk), 
at least one of the foilowing conditions is 
satisfied: 
a) MO(m) ais a negated-forward candidate for 


deletion; 
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b) MO(m) is a parallel-forward candidate for 
deletion with respect to MO(s), such that 
s2j); 

c) MO(m) 1S a parallel-backward candidate for 
deletion with respect to MO(s), such that 


ay MS a 


Proof of sufficiency: 

Condition 1 -and Condition 2, ‘as with parallel-forward 
actions, insure that MO({i) <=> MO(j), i < j; and, either 
sk(j) may be fanned-out at MO(i), or sk(j) may replace sk(i) 


at MO(i). 


Condition 3 for sk({j) # sk(i) insures that if sk(j) is a 
prime source unit at MO(m) where i<m < j, it may be 
replaced by an alternate source unit; or, MO{m) may be 
deleted. Thus, 

POR MSL yite 746s 7] 


UA(m,sk(4)) = 012. 


Condition 3 for sk(j) = sk({i) insures that if the MO(a) 
definition of sk(j) is used as a prime source unit at MO(m) 
where (il<-a°< mw < j, ‘it may be ‘replaced by an aiternate 


source unit; or, MO(m) may be deleted. Thus, for a= k of 


thepfirst Ue (kpsk.(4)) > 01) (k=Be 1/42). 0665) % 
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LOE =i 1, PO! , 5 5aca—1 
PA (n -SK (iy Sr: 
FOE a=at1  at2 ps. </e04 
UA (m,Sk(j)) = 012; 


UA (a,sk(j)) = 1143. 


Condition 4 states that where sk(j) is a prime sink unit at 

MO(m) for i<m< j, the MO(m) definition of sk(j) may be 

deleted or moved outside the range of deletion. Therefore, 
SOr B=LEI tt ap das 3-1 


WA (a -ok Cj), = 1. 


Condition 3 and Condition 4, together, yield the following 


parallel-backward deletion requirement for sk(j) # sk{i): 


J 
z UA(m,SkK(j)) = 2; 
m=i+1 
and, for sk(j) = sk (i): 


Cpt oe te i a en el 
UA (m,sk(j)) <= 1. 


Q.E.D. 


From Theorem 5.1, if there are t Type P alternate sink 
unit references in ALT(i,ask), then MO(i) is examined a 
maximum of t times for its deletion candidacy as a paraiiel- 
backward action. If MO(i) <=> MO(j) <=> MO(k), i> J > k, 
and MO(i) is not a parallel-backward candidate for deletion 
with respect to MO(j), then MO(i) cannot be a paraliel- 
backward candidate for deletion with respect to MO(k). An 


exception to this is as follows: The only required and 
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unsatisfied condition is that Sk(i) replace sk(j).- 


Conditions for negated-forward candidacy are presented next. 


a a sw se ee 


MO(i) is a negated-forward candidate for deletion with 
respect to MO(j) if the following minimal set of conditions 
Usa Satrtstiedspand) nog -condation «conflicts occur, 1Si<N, 


A<jsNet. 
To Ski() = sk (4) < 
o i < ae 


Se nOnen=it tert 2p ae 

where sk(i) € ALT (m,psc(r)), 1Sr<nsc(m,1), 

at least one of the following conditions is 

satisfied: 

a) There exists a feasible alternate in 
ALT (a; ase (2)))5 

b) MO(m) is a negated-forward candidate for 
deletion; 

c) MO(m) is a parallel-forward candidate for 
deletion; 


d) MO(m) is a parallel-backward candidate for 


deletion. 
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Proof of sufficiency: 
Condition 3 states that if sk(i) is a prime source unit at 
MO{m) where i<m <j, it may be replaced by an alternate 
source unit; or, MO(m) may be deleted. Therefore, 
LOY M=iP tito cans 
UA(m,sk(i)) = 042; 
and, from Theorem 5.2 and Condition 3, 
J 
x VAdm 7 SkK.(2)- t= 26 
m=1i+1 


Q.E.D. 


From Theorem 5.2, the number of actions to be evaluated 
for negated-forward deletion candidacy can be determined by 
an examination of adjacent unit definitions in the UA. 
Simply stated, the number of actions evaluated equals the 
number of memory sub-unit redefinitions. This can be 
determined by an examination of UAC 2a ee le LoL 
t= Wepewep No, while j=1,2,<.<,;Ntl. Note that the Taigurative 
redefinition of units at UA(i,N+1) is included in the 


examination. 


As stated previously, the conditions for deletion 
candidacy may contain conflicts. It was also found that 
conditions may contain order-isomorphisms. The next section 
will describe how this may occur, and will pose an original 
theorem which provides a basis for the analysis of order- 


isomorphic deletion conditions. 
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The recursively defined conditions in the last section 
were proven to be sufficient for the identification of all 
possible deletion candidates. However, it can be shown that 
the conditions may never converge; i.e., analysis of 
deletion candidacy for an action may never terminate. This 
problem, called reciprocal dependency, is defined and 
examples are given; then, a theorem is posed which allows an 


analysis of reciprocal dependencies. 


def. 5.2 Reciprocal Dependency: If during the analysis of 
MO(i) as a deletion candidate, it is found that MO(j) must 
be a deletion candidate; and then, during the analysis of 
MO(j) as a deletion candidate, it is found that MO(i) must 
be a deletion candidate, then MO({i) and MO(j) are 
reciprocally dependent with respect to deletion conditions. 
A more succinct definition is as follows: An ordered subset 
of deletion conditions for MO(i) is order-isomorphic with 
respect to an ordered subset of deletion conditions for 
MO(j), 15i,j<N. For a general description of or der- 


isomorphism, see Theorem 2.12 in [8]. 


To illustrate reciprocal dependency, the following 


program node is presented. 


Bed): ADD {1; 3} t33 
I (2): SUB {1; 4} {5} 
Gisi-: ADD {1; 3} {2} 


I(4): SUB {1; 2} {3} 
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SE Unte Sf2)) Lsicnot® bwsyea sens .exit (UA. (5,2) “]=) 2p 7) ythen 
examination is given to MO(3) as a negated-forward candidate 
for deletion. If M0(3) is to be a negated-forward action, 
then at least one of the following conditions must exist: 
a) (2,3) is replaced as a prime source unit at MO(4); 
b) MO(4) is negated-forward; 


c) MO(4) definition of unit {3} is moved. 


An examination of the above program node shows that the 
MO(4) definition of unit {3} may be moved to MO(2); however, 
from Condition 3 for parallel-backward deletion, this 
movement requires the deletion of MO(3). Therefore, MO(3) 
and MO(4) are reciprocally dependent with respect to 
deletion conditions. A less elaborate example of reciprocal 
dependency is MO({1) and MO(2) in the foliowing program node. 
{assume fan-out not possible) 

I (4): ADD ee {3} 
E (2) ADD 1423 {oe 
A theorem is presented next which is basic to the analysis 


of reciprocal dependencies. 


Reciprocally dependent deletion conditions would present 
an unsolvable problem, if they were randomly isomorphic. 
However, ite'is’ possible® to)cideatify these conditions; 
furthermore, these conditions need not impede the finite 
analysis of deletion candidates. It is clear from defie5s2 


that reciprocally dependent conditions may be identified. 
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The following theorem establishes a basis which allows a 
finite deletion candidacy analysis to be performed, even 


though reciprocal dependencies may occur. 


Theorem 5.3 If MO(i) and MO(j) are reciprocally dependent 
with respect to deletion conditions, and if MO(j) satisfies 
all deletion conditions in a minimal set, except the 
condition which requires the deletion of MO(i); then, the 
deletion condition which requires the deletion of MO(j) for 


the deletion candidacy of MO(i) is satisfied, 15i,j<N. 


Proots 

Let C(I) and C(J) represent the minimal set of conditions 
for the deletion candidacy of MO({i) and MO(j), respectively. 
Also, let c(i) represent the condition which reguires the 
deletion of MO(i), and let c(j) represent the condition 
which requires the deletion of MO(j).- From def 5.2, 
C(t) PeeGid); cand c(gytéac tr). 

If all C(J) are satisfied, except c(i), then c(j) has been 
reduced to Cia yes This reduction yields the trivial 
statement: The deletion of MO(i) requires the deletion of 
MO (i) fherefore; c(j) is satisfied. 


Q.E.D. 


Theorem 5.3 is presented as proof that the deletion 
candidacy analysis will converge; i.-e., all candidates for 
deletion can be identified by a finite analysis. Before 


consideration is given to deletion dependencies among 
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actions, the deletion conditions are applied to a progran 


node example. 
9-5 Deletion Candidates: Example 


An example is now presented to demonstrate the 
application of deletion conditions to a program node. The 
program node selected is given in Figure 3.1, with the unit 
assignment table given in Table 3.1, and the alternate unit 


assignment table given in Table 4.1. 


Before conditions are applied, examination is given to 
the UA, in order to determine those actions which are to be 
tested for negated-forward deletion candidacy. Aliso, in 
order to determine actions which are to be tested for 
parallel deletion candidacy, examination is given to 


ALS ask (4) pete; 27-s «, NADT (ipask); °i=1,27shaene 


Based upon Theorem 5.2, an examination of the UA given 
in Table 3.1 yields the following actions which are to be 
tested for negated-forward deletion candidacy. (MNO (N+1) 
represents the "busy on exit" status of a memory sub-unit) 

MO(4) with respect to MO(N+1) 

MO(6) with respect to MO (N+1) 

MO({1) with respect to MO (8) 

MO(2) with respect to MO(N+1) 

MO(3) with respect to MO(N+1) 


MO(7) with respect to MO(N+1) 
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Based upon Theorem 5.1, an examination of the ALT (i, ask) 
entries yields the following actions to be tested for 
parallel-forward deletion Candadacy , mad, 2 else 

MO(3) with respect to MO (6) 

MO (5) with respect to M0O(7) 

Also, based upon Theorem 5.1, an examination of the 
ALT(i,ask) entries yields the following actions to be tested 
for parallel-backward deletion candidacy. 

MO(6) with respect to MO(3) 


MO(7) with respect to MO(5) 


The objective is to identify a minimal set of conditions 
which are satisfied and which may be used to effect the 
deletion of the action tested. It is clear, from the 
deletion conditions, that fan-out capabilities may affect 
deletion candidacy. Specifically, without fan-out 
capabilities, the parallel deletion candidacy of an action 
requires the deletion of the sink unit definition at the 
action to which it is paraliel; exception: A redundancy case 
where Sink units are the same. In order to accommodate this, 
deletion candidates will be determined with and without fan- 


out capabilities. 


It is important to note that conditions are to be 
applied in the order of their appearance in Section 5.3); 
furthermore, once a minimal set of conditions is satisfied, 
the analysis is terminated, and the action under examination 


becomes a deletion candidate. Of course, More than one 
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minimal set of conditions may be satisfied; i.e., different 


deletion strategies may exist for an action. 


Fan-out capabilities are assumed to be possible at each 
parallel action. Deletion candidates are described as to 
type (negated-forward, parallei-forward, and parallel- 
backward), and the minimal set of conditions satisfied is 
described. Only conditions which require a specific action 
form are described. (acronym "wrt" is used for "with respect 


t Oo it ) 
Negated-forward candidates for deletion: 
ction Conditions 


MO (6) MO (Nt 1) eNO alternate action form 
conditions are required; unit {5} is 
not used aS a source unit in 


successor actions. 


MO (2) MO (N41) 1. Unit {6} may be “eplaced at 
MO(3) by unit {4}, because (C,4,0) 
is feasible. 

2. Unit {6} may be replaced at 
MO(7) by unit {4}, because (C,4,0) 


is feasible. 
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MO (3) 


MO(7) 


MO (N+1) 


MO (N+1) 


ToqeUnat .{7}* may be replaced “at 
MO(G) by unit {5}, because (A,5,3,6) 
is feasible. 

2« Unit {7} maye be éreplacedIeat 
NO(8) by unit {5}, because (A,5,3,6) 


is feasible. 


ibe No alternate action form 
conditions are reguired; unit {9} is 
not used as a source unit in 


Successor actions. 


Parallel-forward candidates for deletion: 


MO (5) 


MO (7) 


Conditions 


1. coUnit {7}! may be. oneplaced? cat 
MO(6) by unit {5}, because (A,5,3,6) 


is feasible. 


io SPNO alternate action form 
conditions are required; unit {8} is 
not used aS a source unit in M0O(6) 


through MO(7). 
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Parallel-backward candidates for deletion: 


Action wrt Action Conditions 


MO (6) MO (3) i= | nO alternate action form 
conditions are reguired; unit {5} is 
hot used aS a source unit in M0O(3) 


through MO(6). 


MO(7) MO (5) te “No alternate action form 
conditions are reguired; unit {9} is 
not used aS a source unit in M0O(6) 


through MO(7). 


The set of deletion candidates have been determined for 
the case where fan-out is possible. The next section 
describes the deletion candidates and conditions when fan- 


out is not possible. 


The minimal set of conditions for negated-forward 
candidates for deletion are the same as those where fan-out 
is possible. However, the parallel candidates are affected 


by fan-out capabilities, in this example. 
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Parallel-forward candidates for deletion: 


Action wrt Action Conditions 


——<— SS ————_ = —_— a 
SS =e SS — 


MO (3) MO (6) 1. Unit {7} may be replaced at 
MO(6) by unit {53}, because (A,5,3,56) 
is feasible. 

2. MO(6) definition of unit {5} may 
be parallel-backward with respect to 


MO (3) (note order-isomorphism). 


MO (5) MO(7) 1. %MO(7) definition of unit {9} may 
be negated forward with respect to 


MO(N+1). 
Parallel-backward candidates for deletion: 
Action wrt Action Conditions 


MO (6) MO (3) 1. MO(3) definition of unit {7} may 
be negated-forward with respect to 


MO(N+1) . 


The example has been presented to demonstrate the 
application of deletion conditions. It is by no means 
supportive of the condition sufficiencies. Sufficiency 


proofs have been offered in Section 5.3. 


_ 


te susie Oi in sae @ (ie 
pel Ap ee ee Onen 
‘did ‘ambdban oy ail | 

yon. fr sig ea edna otek a 
od Bry heed) arte San erae ~he Lian (i ot - | 
. (exLéyaaereloaelzo mio teniee: _ 


nett 2260 Ty mematton (in vt aOR 
6: Fougeem:.. daa bxamaed (b-0quel =m 9 4 


BL aat stall 


revhseies ‘i 


126 it Sina in eadsin: 200 HEF: 
> Seg) APNE (DEP OI“) Hepes 


5-6 Summary 


This Chapter has been presented to define those 
conditions which are required for the determination of 
deletion candidates. The general theoretical approach has 
been to define conditions which are necessary to effect 
changes in UA states. These new states are intended to 
conform to the nonessential action definitions given in 


Séction 3.3% 


An important concept which is presented in this Chapter 
is reciprocal dependency. A theorem has been posed which may 
be used in the analysis of order-isomorphic deletion 


conditions. 


With reference to the solution approach described in 
Section 3.3, Chapter 4 presented an algorithm for the 
identification of all £Orns, F(m), of action (m), 
m=1,2,...,N. Through an analysis of F(m), this Chapter has 
defined the conditions necessary to identify all deletion 
candidates, D. It is now necessary to describe the deletion 
strategies which may be employed in the deletion of actions 
represented in D. Chapter 6 describes deletion strategies 


andlstratégy. \conflicts, so thatecriteria of optimaitaction 


deletion may be defined. 
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CHAPTER VI 


ANALYSIS OF DELETION CANDIDATES 


Chapter 4 showed that ali possible alternate action 
forms can be identified. This alternate action form 
information is used, in Chapter 5, to develop conditions 
required for an action to be a nonessential candidate for 
deletion. The purpose of this Chapter is to describe the 


analysis of deletion candidates. 


As described previously, the deletion candidates are 
identified independently. in order to effect action 
deletions, it is necessary to evaluate the dependencies 
among deletion candidates. The objective of deletion 
candidate analysis is to determine the greatest number of 
actions which can be deleted from a program node. 
Section 6.2 describes the variables of deletion candidate 


analysis. 


In a broad perspective, deletion strategies are 
identified for each deletion candidate. Then, through an 
analysis of deletion strategies and strategy conflicts, an 
optimal solution is determined. Section 6.3 describes the 
strategies and conflicts, and Section 6.4 presentS a means 


for their representation. 
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The graph-theoretic approach given in Chapter 3 
describes the combinatoric nature of this particular 
optimization problem. Simply stated, analysis is performed 
to find the combination of deletion strategies which results 
in the greatest number of action deletions. Section 6.5 
discusses criteria of optimal strategy selection and 
presents a strategy selection approach. The approach 
presented is not designed for its efficiency in terms of 
order of computation; however, this is not the purpose of 
this study. This study is designed to show that alternate 
action forms are determinable; and, deletion strategies may 


be defined which include an analysis of these forms. 


The purpose of this section is to describe the general 
approach used to evaluate deletion candidates. The analysis 
of deletion candidates involves an evaluation of the 


following three variables. 


1. Deletion strategies: Those strategies which may be 


employed in the deletion of an action. 


2. Strategy locations: Those actions in the program 
node where a strategy must be applied, in order to delete an 


action. 
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3. Strategy conflicts: Those Strategies which cannot 
be used together for the deletion of an action or a set of 


actions. 


As described in Chapter 5, an action is a deletion 
candidate if, and only if, a minimal set of deletion 
conditions is satisfied. Alternate source unit assignment 
feasibility conditions are included in the set of deletion 


conditions. 


For the purpose of deletion analysis, deletion 
conditions are placed in two broad categories: Constraint 
conditions and action form related conditions. Constraint 
conditions describe an equivalence Or positional 
relationship which is basic to an action's deletion 
candidacy or to a unit's assignment feasibility; e.g., 
MO(i) <=> MO(j) and i < j. Clearly, no strategy is implied 


by constraint conditions. 


Action form related conditions reguire the existence of 
a specific action form; e.g., alternate source unit 
assignment or negation of some action. Deletion strategies 
are determined from an examination of action form related 
conditions. More precisely, deletion strategies are action 


form related conditions. 


In order to use the deletion strategy variable in 
candidate analysis, it is necessary to identify where, in 


the program node, these strategies are to be applied; i.e., 
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the strategy locations. Also, it is necessary to determine 
the strategies available for application at these different 
locations. For example, assume that a prime source unit 
reference must be changed at MO(x), in order to delete 
MO(y)- The prime source unit field of MO(x) is a location 
which requires some strategy. The Strategies available at 
this location may, for instance, be source unit replacement 


or the negated-forward deletion of MO(x). 


The identification of the deletion strategies at 
Strategy locations is sufficient to determine the ways in 
which a candidate may be deleted. The third variable of 
analysis, strategy conflicts, is necessary to evaluate which 


set of actions may be deleted from the program node. 


In summary, the general approach used to evaluate 
deletion candidates is to first, identify all possible 
deletion strategies; second, determine at which locations 
strategies are required; and third, evaluate conflicting 
strategies. The next section presents a detailed description 


of the three analysis variables. 


The variables of deletion candidate analysis are 
strategies, strategy locations, and strategy contiicts. This 
section describes these variables and how they are 
determined. The next major section presents a means for the 


representation of these variables. 
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Deletion strategies are described in terms of condition 
options which may exist for the deletion of an action. 
Strategy locations (points) are described with reference to 
each type of deletion condition. Strategy conflicts are 
explained in terms of strategy options which may not be used 


together, to delete an action or set of actions. 


—— 


Strategies which may be employed in the deletion of an 
action are determined from an examination of action form 
related deletion conditions. These strategies are divided 
into two general classes: Those which involve the 
replacement of prime source units; and, those which involve 
the deletion of an action. These two classes are the options 
available for the satisfaction of nonconstraint type 
deletion conditions. The second class, action deletion, may 
be performed by the application of five possibie strategies. 
Thus, there are six strategy options available for the 


deletion of an action. These options are described below. 


1. Source unit replacement (SRi): Those strategies 
which require the assignment of a feasible alternate source 
unit as a prime source unit; i is the source unit referenced 


(e.g., i = 2 means replacement of the second source unit). 


2. Negated-forward deletion (NF): Those strategies 


which require the negated-forward deletion of an action. 
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3. Parallel-forward deletion with fan-out (PFF): Those 
strategies which require the parallel-forward deletion of an 


action, where sink unit fan-out is performed. 


4. Paralleli-backward deletion with fan-out ({PBF): Those 
strategies which require the parallel-backward deletion of 


an action, where sink unit fan-out is performed. 


5. Paraliel-forward deletion without fan-out (PFR): 
Those strategies which require the parallel-forward deletion 
of an action, where nontrivial sink unit replacement is 


pertormed. (recall, sk{i) = sk{j) is the trivial case) 


6. Parallel-backward deletion without fan-out (PBR): 
Those strategies which require the parallel-backward 
deletion of an action, where nontrivial sink unit 


replacement is performed. 


The deletion of an action may be described compietely in 
terms of these strategy options. Specifically, the deletion 
of an action may be expressed as a combination of source 
unit replacements, negated actions, and parailel actions. 
From this, it can be seen that the set of deletion 
candidates together with all feasible alternate source units 


completely define the set of deletion strategy options. 


The deletion of an action is nonunique, as there may 
exist more than one set of strategies for its deletion. 
Therefore, it is necessary to identify all possible sets of 


strategies which may be employed in the deletion of an 
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action. This is accomplished by the identification of 
actions where strategies are to be applied; then, all 
strategy options are determined. The next section discusses 
the identification of actions (strategy points) where 
strategies are to be applied; and, describes the 


determination of strategy options at these strategy points. 


Many strategy sets may exist for the deletion of an 
action. The concept of strategy points is helpful in the 
identification of these sets. This section describes the 
strategy point variable of analysis. Also, a theorem is 
presented which is basic to the representation and analysis 
of deletion strategies. First, strategy point is formaily 


defined. 


def. 6.1 Strategy point: Any action's source or sink unit 
which requires aliteration (replacement, deletion, or 


movement), in order to delete an action as nonessential. 


Strategy points are referenced by MO(i.j), where iis 
the index of the action which contains a strategy point, and 
j is an identifier of the unit within MO(i) which requires 
alterations To clarity, if nsc (aj, 1) = 2 and msk.{i) = Ty thes 
MO(i.2) references the second source unit, and MO(i. 3) 
references the sink unit. AS a more descriptive example, the 
following program node is presented (note that at MO(4), 


units {3}, {4}, and {5} are equivalent). 
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Biays ADD {2; 3} {5} 
I(2): GATE {5} {4} 
Liye ADD {2; 3} {3} 
(4): SUB {3; 6} {7} 
T(5): ADD {6; 7} {3} 


If MO(3) is to be a negated-forward deletion candidate with 
respect to MO(5), it can be seen that the MO(4) reference to 
prime source unit {3} must be replaced or deleted. Thus, 
MO (4.1) is a strategy point for the negated-forward deletion 
of MO(3) with respect to MO(5). It is necessary to determine 
all strategy options which may be employed at this strategy 
point. If unit {7} is busy on exit (MO(4.1) cannot be 
deleted), then the following strategies are available for 
the repiacement of unit {3} at strategy point MO(4): 
Replacement by unit {5}, because (C,5,1) is feasible; and, 


replacement by unit {4}, because (C,4,2) is feasible. 


The conditions described in Chapter 5 are applied to 
each strategy point, in order to identify all strategy 
options. Based upon those action form related conditions, 
strategy points are now described for negated-forward, 
parallel-forward, and parallel-backward candidates Hoc 


deletion. 
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ESE MO(i) is examined for negated-forward deletion 
candidacy with respect to fo), then Le oy and 
sk(i) = sk(j). The strategy points for its deletion are 
given by the following: 

Poem 1 ee ato ee 


where UA(m,sk(i)) = 1]3. 


If MO(i) is examined for parallel-forward deletion 
candidacy with respect to MO (4) 7 them tpe<igane aud 
MO(i) <=> MO(j). The strategy points for its deletion are 
given by the following: {assume MO(j.3) references sink 
unit) 

SOnem=a1Ltt i426 << 7) 


where UA (m,Sk(i)) 
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=_ 
Ww 
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Porta it lt 27 ess, jo 1 
where UA(m,sSk(i)) = 2]3; and, 
if fan-out is not possible at MO(j), 


NO (7-3) - 


If MO(j) is examined for parallel-backward deletion 
candidacy with respect to MO(i), then a2 < j, and 
MO(j) <=> MO(i). The strategy points for its deletion are 
given by the following [if sk(j) = sk(i), then a = k of 
first UA (kysk(4)) > 1 (k=it1,it2,--.6,j); if sk(5) # SK(A), 
then a = ij: 

For m=at+1,at2,-.--,¢5 


where UA(m,Sk(j)) = 113; 
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fOr, NHdt17142 32 e5554 
Where UA(m,sk(j)) = 2]3; and, 
if fan-out is not possible at MO(i), 


MO(i.3). 


Ttenes clearltibataiteali Strategy options are determined 
for each strategy point, then all possible combinations of 
strategies are identified for each deletion candidate. The 
representation and analysis of strategy combinations is 
facilitated by the identification of unigue strategies. The 
following original theorem is presented as a basis upon 
which strategy combinations may be represented and 


evaluated. 


Theotem 6.1 Given i equivalent actions in the program node 
PN, j feasible alternate source units over PN, andk 
redefined sink units in PN (units not busy on exit are 
designated as unit definitions); then, the number of unigue 


strategies available for nonessential action deletion over 


PN is bounded above by 21 + j + k. 


FEoODs 

The deletion conditions which relate to action forms are 
forward parallelism and negation, backward parallelism, and 
alternate source unit assignment. From these deletion 
conditions, Theorem 6.1 is immediate; since equivalent 
actions define parallel actions, and redefinitions include 
the busy on exit Status of memory Sub-unitss, Finally, Lt 


fan-out is possible at each equivalent action, then each 
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parallel candidate for deletion may be effected by sink unit 
fan-out or by sink unit replacement (2i). 


Q.E.D. 


Corollary 6.1 From Theorem 6.1 and Theorem 5.2: 
The number of deletion strategy combinations is finite, and 


deletion strategies are convergent. 


Two of the three variables of analysis, deletion 
strategies and strategy points, have been described. Also, 
their determination and relationship have been discussed. 
Finally, deletion strategies have been shown to be finite 
and convergent. The analysis of deletion candidates is 
described in terms of strategy options available at strategy 
points. The third variable reguired for the analysis is 
Strategy conflicts. Strategy conflicts provide a means of 


evaluating the relationship between deletion candidates. 


The identification of strategy options at strategy 
points provides a means for determining ways in which a 
deletion candidate may be deleted from the program node. Of 
course, in order to delete the maximum number of candidates, 
there must exist some means whereby sets of deletable 
actions may be identified and evaluated; i.e., examination 
is given to determine which set of candidates may be deleted 
from the program node. Relationships among deletion 


candidates are evaluated by strategy conflict analysis. 
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For the purpose of analysis, strategy conflicts are 
divided into two categories: Absolute strategy conflicts and 
deletion impedance conflicts. The following definitions are 
presented to qualify these categories; then, an example of 


each category is presented. 


def. 6.2 Absolute strategy conflicts: Those strategy pairs 
which imply different deletion approaches of the same unit 
definition (null deletion approach included); and, those 
strategies which describe different alternate source unit 


asSignments for the same prime source unit of an action. 


def. 6.3 Deletion impedance conflicts: Those strategy pairs 
in Which one strategy results in the null form of MO(i), 
while the other strategy requires the nonnull form of Cp(i), 


SCG@2))i,7 aud c(h), Si<N, 18jsasc (i) 


As an example of the first category, absolute strategy 
eGnkivets,. consider’ the “followings %case:)" (Cc, 473) Wis a 
feasible alternate source unit at MO(8), and MO(3) is a 
negated-forward candidate for deletion with respect to 
MO(9). Two strategies are implied by this case: 1. The 
negated-forward deletion of the MO(3) definition of unit 
{43}; and, 2. The assignment of unit {4}, defined at MO(3), 
asiasprime (sources ainit. Strategy Wo regdires «a negated- 
forward deletion approach for the unit {4} definition at 
MO(3), while Strategy 2 requires a null deletion approach 
for the unit {4} definition at MO(3); therefore, from def 


6.2, Strategy 1 and Strategy 2 are absolute conflicts. 
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AS an example of the second category, deletion impedance 
conflicts, consider the following case: M™O(3) is a negated- 
forward candidate for deletion with respect to MO(9), and 
MO(8) is a parallel-backward candidate for deletion with 
respect to MO(3). Two strategies are implied by this case: 
1. The negated-forward deletion of MO(3); and, 2. The 
parallel-backward deletion of MO(8). The first strategy 
results in the null form of MO(3), while the second strategy 
requires the exastenece sol, ooptey pesc(aedy4. and ¢(3), 
1sjsnsc(3). Therefore, from def 6.3, Strategy 2 impedes the 
action deletion described by Strategy 1. It should be noted 
that deletion impedance conflicts may exist in the optimal 
strategy selection. However, absolute strategy conflicts may 


never exist in any selected strategy set. 


It is clear from the conflict definitions that if all 
strategy options are identified, then all conflicts may be 
determined. Strategy options, strategy points, and strategy 
conflicts have been discussed. It is intended that these 
three variables of analysis be applied, in order to 
determine a set of strategies which will optimize the 
objective function. The optimal strategy selection criteria 
are discussed in Section 6.5. However, before strategy 
selection is examined, a means of representing the analysis 


variables is presented. 
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The analysis of deletion candidates is facilitated by 
the representation of strategy options, strategy points, and 
strategy conflicts. The proposed scheme is developed as a 
means of concisely representing the following relationships 


among analysis variables. 


1. Different action deletion approaches; i.e., the 


different ways in which an action may be deleted. 


2. Strategy points defined for different action 


deletion approaches. 
3. Absolute strategy conflicts among strategy options. 
4. Deletion impedance conflicts among strategy options. 


5. Strategy options available for the alteration of 


strategy points. 


In a broad perspective, these relationships are to be 
represented in a form which will lend itself to deletion 
candidate analysis. Specifically, the representation form 
should contain all the information necessary to select an 
optimal strategy set. The selection of strategies is 
deferred to Section 6.5. The representation form, strategy 


table (ST), is described next. 
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All of the analysis relationships may be represented 
independently, in tabular form. This section describes the 
Strategy table (ST) whose purpose is the representation of 
these relationships. First, the structure of the strategy 


table is described; then, an example is presented. 


6.4.1.1 $s" Structure 


It is clear that all deletion candidates are represented 
in the set of all possible strategy options. Furthermore, 
deletable actions (deletion candidates) and feasible 
alternate source unit assignments make up the set of all 
possible strategy options for a program node. The ST is 
structured to represent all strategy options and strategy 
points. Therefore, all deletion candidates, feasible 
alternate source unit asSignments, and locations which 


require a strategy are represented. 


Strategy points are identified by an examination of the 
UA and the ALT. Strategy options which may be applied to 
these points are determined by the application of action 
form related deletion conditions. These conditions include 


alternate source unit assignment feasibility conditions. 


To describe the structure, the following notation is 


presented. 


nD elements in the set of deletion candidates D. 
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d(i) unique strategy points defined for the deletion of 
the action represented in D(i), 1<i<nD. 

nF elements in the set of unique feasible alternate 
source units F, [F is determined by those possible 
assignments at d(i) ]. 

£(q) unique strategy points defined for the feasibility 
of the alternate source unit represented in F(j), 
AS jSur 


Using this notation, the ST is defined to be a K by K matrix 


where 
nD nF 
K = nD + nF + fF d (i) + > ECA ye. 
i=1 j=1 


From the above equation, it can be seen that the ST is 
structured to represent the different deletion candidacy 
options available for an action's deletion (nD); the 
different feasible alternate source units (nF); the strategy 
points for an action's deletion {d(i)]; and, the strategy 
points for a unit's alternate source assignment feasibility 
{£(j) ]. Thus, all possible strategies and strategy points 


are represented in the Sf. 


The strategy table is divided into three sub-matrices. 
Each sub-matrix is designed to represent one or more of the 
relationships described previously. The purpose, 
organization, and elements of each sub-matrix are now 


described. 
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1. Strategy conflicts sub-matrix (SC): The purpose of 


this sub-matrix is to represent the conflict relationships 


among strategy options; also, by its organization, the SC 


represents the different action deletion approaches. Using 


the notation described in this section, the SC is an 


(nDtnF) by (nDtnF) matrix. ST(i,j) is resident in sc, if 


1 = 2S (nDtnF), and 1s j < (nD+tnF). Furthermore, the SC is 


organized to represent deietion candidates in ascending 


action index sequence, with all action deletion 


approaches 


grouped by action. Sf(1,1) through ST(nD,nD) represent 


deletion candidates. ST(nD+1,nD+1) through ST(nDtnF,nD+tnF) 


represent alternate source unit assignments. Therefore, the 


SC represents all possible strategy options. Each eiement in 


the SC sub-matrix assumes one of the following values: 


SC(i,j) = 0, if strategy option i does not conflict with 
strategy option j. 

SG(ipypecet;, it «strategy  ceptiongyi, prsetan absolute 
strategy conflict with respect to strategy j- 

SGli,jjo=' 2,9 ifs strategy option’ sisimpedes thevaction 
deletion described by strategy option j. 

SCUli,3)) =03, if strategy option®,j «impedes the “saction 
deletion described by strategy option i. 

SC (i,j) = 4, if strategy option i impedes the action 
deletion described by strategy option a and 
strategy option j impedes the action deletion 


described by strategy option i. 


Since no action deletion is identified by feasible alternate 
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source unit assignments, SC(iF4) = ofizZ for 
nD < i < (nD+tnF) ane ket, SS nD; Sci’. 4) = O(n for 
isa = 98D and nb <4 < (nDtnr); and, SC (aj) = Olt (fon 


nO <= (nDtnF) and np < 4 < (nD+tnF). 


Z2eeeStrategy”™=points for Strategy options sub-matrix 
(PO): The purpose of this sub-matrix is to represent those 
strategy points which require alteration, in order for a 
strategy option to exist. The PO is a (nD+tnF) by (K-(nD+nF) ) 
matrix. ST(i,j) is resident in PO, if 1< i < (nDtnF), and 
(nD+tnF) < j < K. Each element in the PO sub-matrix assumes 
one of the follcwing values: 

PO(i,j) = 0, if strategy option i does not require 

alteration of strategy point j. 
PO(i,j) = 1, if strategy option i requires alteration of 


strategy point j. 


3. Strategy options at strategy points sub-matrix 
(SP): The purpose of this sub-matrix is to represent those 
strategy options available for the alteration of a strategy 
point. The SP is a (K-(nDtnF)) by (nDtnF) matrix. Sii(1 je. 
resident in SP, if (nDtnF) <i< K, and 1S j S$ (nDtnF). 
Each element in the SP sub-matrix assumes one of the 
following values: 

SP(i,j) = 0, if strategy option j may not be used to 


alter strategy point i. 
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SP(i,j) = 1, if strategy option j may be used to alter 


strategy point i. 


It is noted that although the st is structured as a 
K by K array, ST (nD+taF+1,nD+nF+1) through ST(K,K) are unused 
elements. A review of the ST structure shows that all 
analysis relationships are represented. The next section 
develops the representation of the program node given in 


Figure 3.1 into a strategy table forn. 
6.4.1.2 ST Example 


Chapter 5 showed the deletion candidates for the program 
node given in Figure 3.1. Therefore, the deletion candidates 
subset of strategy options has been identified. The 
successive application of the conditions described in 
Chapter 5 will yield the remaining strategy options; namely, 
the feasible alternate source unit assignments. The strategy 
options and related strategy points are given in Table 6.1 
(accronyn "wrt" is used for “with respect to"). The ST ID 
entries in Table 6.1 and Table 6.2 are reference numbers 
which correspond to entries in the strategy table. For 


generality, fan-out is assumed to be possible at all 


equivalent actions. 
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Table 6.1 Strategy Options and Strategy Points for Program 
Node Given in Figure 3.1 
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The application of deletion and alternate source unit 
feasibility conditions to the strategy points given in 
Table 6.1 yields the strategy options which may be employed 
in the alteration of strategy points. Table 6.2 shows the 
strategy options at strategy points. The information Shown 
in Tables 6.1 and 6.2 is used in the construction of the 


strategy table given in Table 6.3. 
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fable 6.2 Strategy Options» at Strategy Points for 
Program Node Given in Figure 3.1 
ae 
LSS Gee 8 TEE Gry O°? VE TE WAN es 71 
Sameera a 7 ee iS 8o SS eS ie 7 
{Sst |- Strategy -{ Action or (Strategy| wot {-- 524 | 
Pros | Points | ALT (asc) {7Option | Action { vefs.]| 
fst St =A 
(187s MO 63 22) fe (Ce4 70) [OgSR2 } M™0(3) ie 4270 
! | i | | | i 
1 ial be LY { MO(3) { NF | MO (N+1) | Zatt 
| { l | | i l 
LaSel na FX | MO(3) rier { MO(6) i 30] 
{ | 1 i { i { 
ide a “ " j MO(3) { PFR | MO (6) | 4 | 
lf: oor Got ate et eS ee 
119 | MmOo(3.3) { MO(3) | NF { MO(N+t1) i 2041 
| | | i | | | 
(ete oan | MO (3) PeeErs | M™o(6) | Su ii 
| i I { | | i 
ae pgs { MO(3) PoP PPR | MO(6) i 74] 
ene Se aan 6 pao &. 
120 | MO(6.1) DECARS 73 SOG leesrt 1 MO(6) be wei 
| l | j | { 
be el 94% | MO (6) ieene J MO(N+1) | Ley 
i { i { ! { ( 
FEE! ‘wige | HO (6) PORRse { MO (3) | I i 
| { | | l i | 
Pavel et { MO(6) { PBR | M™o(3) i I} 
ee ee a et 
2041 MO (6.3) | MO(6) | NF { MO (N+1) | Ott 
i i { | | 1 
Petey wey’ | MO (6) | PBF | MO (3) | 8 ] 
| { i I | | j 
betes "oo | MO (6) | PBR { MOo(3) { Seri 
ay fee ee er 
i224 he MO{7S2) $9074 A0) { SR2 | MO(7) (a 413 | 
i i i | { { 
i" | non { MO(7) | ONE {| MO(N+1) to Wit 
| ! i { { I i 
pa “oom } MO(7) [.sebF ) Pp ene) f at | 
$$$} —— | st 
{23 | MO(7.3) { MO(7) | NE {} MO(N+1) as | 
i | { 1 { { 
ot ny nom {| MO(7) [ecPBEF |) f{ GMO(5) a 
{+t — | SG Se = ger 735s aa im 4 
124 | MO(8.1) A wAsias ay o[ DeSean {| MO (8) ; -45 | 
{ | l 
: a me ke ( (po73 75) 81 7581 } MO(8) 1 116 | 
{ { | { i 
" , non () AC#5 26) { ESR f} “NO (Ss) ea | 


a 


mS ae 


we “Se ‘ aes ee 


7 5 a > 
\ : i tye of § +t / ; ae 5i TS. an be 
- { ney Oe see tage | Teesyess a5 ft aoe st 
——— = eee te et es oe 
j iv \lis | f { rete ; ASSEN Nd 
i i i t ay. 
; i (Fer pus ft mo) pow 1 . et a v 
j u ; i . 
} : (?} aa ; j t£ yon a3 nn 
; ' , .. Seon. Th Sar eoG 
a wig ; c j {cj 0e>4 2-4 aah : 
SS eS a a ee : 
» irae (498 -f— (4.608 ene. 
: } | a: 
. nn ‘ ime ‘gipames ee | +. 
i : ; } J Pol 
i ‘ or 4 i (49Ge@ j ee ok | 
6. ee |) Se es ee es tins Se eee See = ae = sr ~ 
: ' f 1 ge bytyab -& 
i 4 j ' 4 
e! rae ae | ei 
t ‘ i i] 
\ "ou fos (9¢C8 4 a 
' : 
1 f i’ m { (ajve f - 
pe ee em eS 
i ‘ y , fF aft j f (ape - + (6.4) 0® 
@ | | : o 
‘ iL; | ean) f (ehur *. @i 
i ' i of 
\ \ >Hi i i i ( * o ap" 
pa pe a ee samen 2 
hb. el “al (Fi ej i st pom 14 


| : 4 aA - 
ay be 7 
e * Lea 


Table 6.3 
peeere 
is f RA TE GY 
t 


{| deletion candidates {| 


i 
| | | | 
| { i {1 
hTp2 SaAslSte. ph 8u9 [0 
Se SS Soa Rees Geennaeoeecr listnacmer Gocsesteceeenns 6 
bo he y2 
i |d-—-+-+_——__+}_+_- 
itte(S 21/010 1 170 O70 3 31/0 0(3 
1 (li iol i | | { 
HRilel3 34031 0 140 OF2 4 440 043 
pel | { i | { 
Al{il3 41041 1 OJO0 OF2 4 4YO 073 
(of eee 
Dimlse 15 (OROC OR OROL [080 a 410 
ial | 
Eicts 
ae 
GIatew “f(0 (0393 1080 Ort 11 (O70 FO 
id| | | j { | 
VL [6 Sepfopz §24)0. 0) TeOetf0e0(0 
jd] i | j | | 
jafj6 91,012 4 410 011 1 OFO OO 
Ue 
{el7 1047070 O OF 3 340 0 OFO 140 


17 11400 0 O74 440 0 OF 1 OFOD 


0 


0 


0 


0 


3 


3 


=) 


| i l i 
6/0(0 0 O[17 010 0 042 410 00 0 0 Oj0 


El 


OP Ff 2 ONS! 
ALT unit | 


ae 
1/010 0 040 OYO O OfO 010.0 0 0 O OF1 


t 32.0 [8 


| 
oO 3-076 
0 3 010 
0 0 040 


mies Fea ack ek ae 


$2) 0 252 
jAs 1 | d | | 
131010 0 010 0 042 240 
iTi i] i 1 | i 
THfO[222 011 0 O10 O40 

i | i 
O[2 2 250 018 
jie i | i { i { 
0417 0 040 040 
i t 1 4 | l | | 


| 
{ 
i 
l 
i 
| 
i 
| 
l 
{ 
i 
{ 
| 
1 
1 is] | | i i l 
i 
i 
| 
i 
| 
i 
| 
i 
| 
{ 
| 
i 041 0 140 040 


2h Or OOS OR a 0s. 02 141 
25 (0UNG20S0.0 CO Ge08O 11 


G20; O00 010 


0 


0 


Coogee O20 CTO] Cad 


0 


0 0 010 


| 
0 0 040 

] 
0 0 140 


(eS) 
(2) 
So 


(>) 
i>) 
>) 


Se 
Sy RS) =) 
oS a C 


— 
— 
— 


POINTS 


Strategy Table for Program Node in Figure 3.1 
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The variables of analysis were described in Section 6.3. 
The representation of these variables, along with an 
example, has been presented in this section. In the next 
section, these variables are used to show that the selection 


of an optimal strategy set is possible. 


6.5 Optimal Strategy Selection Criteria 


Alternate and equivalent action forms are identified by 
a deterministic algorithm. These action forms are used in 
the. definition of a minimal set of deletion conditions. The 
deletion conditions are applied to actions in a _  progran 
node, in order to determine those actions which are 
nonessential candidates for deletion. Examination is given 
to these deletion candidates, to determine ali strategies 
which may be employed in their deletion. The inciusion of 
alternate action forms in the deletion of nonessential 
actions requires one additional step: The selection of a 
set of strategies which results in the maximum number of 


deleted actions. 


From the previous sections in this Chapter, it is clear 
that all deletion strategies, strategy points, and strategy 
conflicts can be determined. Furthermore, it is asserted 
that the information contained in the strategy table is 
sufficient to identify those strategies which may be 
employed in the deletion of an action or a set of actions. 
The strategy table is used throughout this section to aid in 


the description of a strategy selection approach. 
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First, the determination of an optimal strategy set is 
proved possible; then, an example is presented. The optimal 
strategy selection approach is presented with the objective 
of showing feasibility. The implementation technique used 


for such an approach is not evaluated in this study. 


The objective function may be restated as the selection 
of a set of strategy options which results in the maximum 
number of action deletions. The following original theorems 
establish a basis for the description of a feasible optimal 


strategy selection approach. 


Theorem 6.2 Given a program node PN: Let D be all strategy 
Options which describe action deletions. If all absolute 
strategy conflicts and deletion impedance conflicts are 
identified for strategies in D; then, an optimal solution 
upper bound can be determined, and all strategy option sets 
in D which possibly yield the optimal solution can be 


determined. 


Proof: 


This theorem is immediate, since absolute conflicts identify 
all pairs of strategy options which can be used together; 
and, deletion impedance conflicts identify all pairs of 
strategy options which result in the impedance of either or 
both action deletions described by the pair of strategy 


options. From def 6.2 and 6.3, no strategy option pair in D 
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may represent both an absolute and an impedance conflict; 
and, absolute conflicts can occur in D only for ‘strategy 
options which describe the deletion of the same action. 

If D contains ali strategy options which describe action 
deletions, then all deletion candidates are represented in 
D. Therefore, an exhaustive examination of strategy option 
pairs which are not absolute conflicts will identify all 
possible combinations of unique action deletions from PN. An 
exhaustive examination of impedance conflicts within these 
possible combinations will identify a maximum number of 
action deletions (optimal solution upper bound); and of 
course, all possible combinations (strategy option sets in 
D) which can yield an optimal solution are also identified. 


Ops ESE 


Theorem 6.3 Given a program node PN: Let D be all strategy 
options which describe action deletions; F be ali strategy 
options Which describe feasible alternate source unit 
assignments; and, P be all unique strategy points. With no 
consideration given to F and P: Let G be a set of s strategy 
options, G ¢ D, with no absolute strategy conflicts and with 
t impeded action deletions described in 6G, t < s. The 
following constraints are necessary and sufficient for the 
selection of G', such that (s - t) actions may be deleted 


from PN by the strategy set Gu (Gf G‘): 


7 
= f-% a0 
ea ciel 


St": " 

- | tes scl = 8 
yeevaaic: sol Gaerne ab aaser neo * iow 9 
»wo) Sia orhe nth te antes eb #09 - ats a8 b salAg 

aghige G6 lrea eh ee plier ste! feores tienes 

+ hetiowosyed 4 ees bisne> Ve ng feb: someone 7 
nalign roth Ya, Gistad#ane € wihen pens ae, enon a 

p Tse Ltée erat iteos su koade jou Jae oe ee 
we 9 Vws!s Aen psoleh Chine gp ine Sean inihdecaetma 


> RAR 
- 


a £Qeus 


b w4deie, eral l ead 9eckayal fe. sotten teste ~ evisaraiae 
jHitoel "ep etidarns Lise aies.satdao>, eitiaeeg 
wip 8 . mare i : what 5 wah (GD) savssetob a0aoe 
P vind be pin 9%) Giles IGs oveeD sidianod Ihe e202 


r je Evil pz . — ‘> we —- ot. H ing ‘a ‘bLlelty tis) dataw qe 


Lap ae ‘ Y fa8 an : Gespeow a terke (6.9 (peaeel 
oo Pes 
vVo-~pecta, TS y sancihs¢* uistoe. C4R-RoenFt sulawenosege —- 


#6 
we Wa ote ieanw Ps p6ies@? «+eiegesh® aolide ‘eaOETqO 


oe (ati CIR Ag veces oo pnt fle a Baker ja saeuar — 
yootnsta cstn ler 8 > +t mega aaeks woawe7ah oo 
. ; {a 
Pa ; i404 2a ba - ot iieten diy | | eno qo 
=a : 7 _ 7: 


> 
a @°s aD e ; q [34% ° vu anve wal oa a ba leey, : 


= ——- : ss = a 
@'e403 55905 ve pagar 


+é¢n0 lee 
gh fnew 


145 


ley All. P defined for G and G" are altered by G"; 

2. If strategy option G'(k) in G' is selected to alter 
Strategy point MO(i.j) in P and G'(k) € D; then, the 
action deletion described by G'({k) must not be 
Mupededuhye ea, (6 yeG") for 4) <-nseta a. 

3. No absolute conflicts exist in Gy (G wy G'); 

4. [Deleted actions in GU (G aA G') ] - [impeded 


deletions’ 2n°G u (Gp G') ] 2° (s =°t). 


If G is a set of s strategy options in D, and no absolute 
conflicts occur in G, then G is a set of strategy options 
which describes s unique action deletions. If there are t 
impeded action deletions in G, then (Ss - t) defines the 
number of actions which may be deleted from PN by G. 
Regarding Constraint 1: 

To delete (Ss - t) actions from PN, ali P defined for G must 
be altered; furthermore, all P defined for strategy options 
employed to alter all P for G must be aitered. The 
termination of this recursive condition is assured by the 
convergence of deletion conditions (Theorem 5.2) and 
deletion strategies (Corollary 6.1). This condition is 
defined by Constraint 1. The feasibility of selecting a G' 
which satisfies Constraint 1 is insured by the following: An 
exhaustive examination of deletion conditions for deletion 
candidates identifies all P required by D ¥ F and all DU F 
Which may be employed to aiter each strategy point P(v), 


V=1,27542) 0s Since stor any P(v) € 2, the alteration of P(v) 
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is possible by at least one strategy option in DU F, at 
least one G' exists which satisfies Constraint 1. 

Regarding Constraint 2: 

From an examination of the deletion conditions, it can be 
seen that strategy options are selected to effect one of two 
state changes in the UA: State 1/3 changed to State 0j2; or, 
State .2(3. changed, ta, State 0/1 (see «Section 5.2) 5 "State 
Change 1{3 to 0|2 represents the replacement or deletion of 
a prime source unit reference. State change 2]3 to 0j1 
represents the deletion or movement of a prime sink unit 
reference. If strategy option D(a) € D is selected for state 
Change 2j3 to 0j1 at UA(i,u), and D(a) is impeded by some 
other strategy option; then, only the number of actions 
deleted is changed ee the maintenance of Op(i), sc(i,j), 
and c(i) does not affect the change made to UA(i,u), 
1<j<nsc(i) ]. If strategy option D(b) € D is selected for 
state change 1|3 to 0j{2 at UA(i,u), and D(b) is impeded by 
some other strategy option; then, the number of actions 
deleted is changed, and the state at UA(i,u) is unaltered. 
Clearly, this must be avoided, since the P(v) altered by 
D(b) is no longer altered after D(b) is impeded. The only 
case where P(v) may be altered by D(b) and then unaitered by 
the impedance of D(b) is where P(v) iS a _ source unit 
alteration; since, from def 6.2, no nonconflicting strategy 
options exist which can move a sink unit definition twice, 
or delete a sink unit and then cause it to reappear. 


Constraint 2 is necessary and sufficient for the avoidance 


vial ‘a 
BAD ene: ee aw 
: ie -t vakendei oo | 
: oi. cum asiekh! et herein? O29 44 Sahneaes aon ee 
Dow GP eine tet o8yepol 1 2 aeoseenn sats oe 


_ Ta ee eo ee ee ee Sey ae 
rer » Vee Greek of Gopomips £18 edes8 | 
7 ae : 
it , r 7S 3a yes |S RP Qeoundes 40 et A1f opie y 
1 an ; ened: 00 «68a «66ieu «eve oattay 
? rae j i ; ree @. 20 moiteal ob vita ec 
inte she ried [4 & (el 6 aeige Yoosntss TF -soneaiteea 
= a | we tee . wce~ee aa Fi 02° 8 epasds 
iy : aoe 
faa. OOM) fp AggO ainhetote i7Q a 
» a <a) pm whil te , wey] begendo ak busotab 
‘+. 
Thy Dyas ; Oo 082k D208 . GS0D - (hya oh 
=> > ne 14 od i a of > Lees OTe a? “t (LJORREEES 
es -_—-, 
wt F > es O Se teen ay sekalies esas 
a 
ie. @aey aia’ .aeghqa Yjpeiers. 16110: Sige 
_Y 
-<@ : edi 6 .Y 
é a }' ba 7 i 
hon 
v¢ spee S18) €6 (O18 tar ‘pagaot on ek (da 
_ 
é 3 oat 1 a Pa) - Gas iw) oj Sehe 


avy wep | ae ope =e I sles jo saat 
(ao 9@ see. CELe Suhe! i ea yes heen 
ware? unkbbes aot ode io ioe 


hese tt a 


147 


of altering and then unaltering a P(v) € P. Furthermore, 
Since all impedance relations are known for D U F, the 
application of Constraint 2 is feasible by the exhaustive 
examination of impedance relations in G U (G A G*). 
Regarding Constraint 3: 

The feasibility of Constraint 3 is immediate by exhaustive 
examination of strategy conflicts, since all absolute 
strategy conflicts can be determined for all strategy 
OPLLONsSs DQUERZ fhCenstraints 1,02, sand = "3 sane, therefore, 
necessary and sufficient to insure that G' is selected, such 
that Gy (G 4 Gt) alters all strategy points required by 
GU (GA Gt), and that no absolute conflicts exist in 
Gu (GA G*). 

Regarding Constraint 4: 

Since only strategy options which describe action deletions 
may be impeded, the deleted actions described in 
Gu (Gad G") minus the impeded deletions in Gv (G4 G') is 
the set of action deletions from PN. Therefore, Constraint 4 
is necessary and sufficient to insure (s - t) actions are 
deleted. Constraint 4 is feasible by an exhaustive 
examination of Strategy’ conflicts; /ssince wall *Strategy 
conflicts may be determined for all strategy options in 
UTE 

Constraints 1, 2, and 3 insure that all required strategy 
points are altered. Constraints 2 and 4 insure that no 
absolute conflicts occur in the selection of strategy 


options which result in (s - t) action deietion. Finally, 


etoaaess ‘ oha om’ 
coiei® tse om Henrie ad * ad Ss vEltnee ‘Yeeses 


. 


oxen te® ete (fae 2h) ee Reeipaiage: a a 0 aeedag 


_ 
a 


edn px hebalee 40 cosh as en at eS a 38 - 
-.~. 

er 

vd “Schieges 2iteny Yontadie | eae on ' 

a 7 — 

ab eace, omniiiena siaheme 6f gala: va 


ee, aots {ito 
sé ted @xhew tetriady O68 sei ame * £ (aa 
en 
) raethem arexrt,.~ st seubigiaies y doe 
420 66529 e) a = nV anven wee . a a 3 

erkqceas . a ob aoaliy 2% ib derenraccs shed 


148 


Since all variables of analysis can be determined, the 
implementation of the constraints is feasible. 


O38 2D < 


Corollary 6.3 From Theorems 6.2 and 6.3: 


If M is the optimal solution upper bound, then the existence 


of a strategy option set which results in M action deletions 


can be determined. 


From Corollary 6.3 and the constraints given in 
Theorem 6.3, the following steps are presented as a feasible 
deterministic approach to the selection of an optimal 
strategy set. Notations estabiished in Theorem 6.3 and 


Corollary 6.3 are used in these steps. 


Step 1. From an examination of conflicting D elements 
in the SC sub-matrix, determine M, the optimal solution 


upper bound. Continue with Step 2. 


Step 2. From an examination of conflicting D elements 
in the SC sub-matrix, determine a unigue set G, which 
results in M deletions. If no such set exists, set M=M- 1 


and continue with Step 2; otherwise, continue with Step 3. 
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Step 3. From an examination of required strategy points 
in the PO sub-matrix, the strategy options in the SP sub- 
matrix, and the strategy conflicts in the Sc sub-matrix, 
determine on there eexists? a iG'» suchy >that elthes\fotr 
constraints given in Theorem 6.3 are satisfied. If such a G! 
exists, then G UY (G 4G") is an optimal strategy set 


selection; otherwise, continue with Step 2. 


To demonstrate the approach described, an example is 
given next. The technique used to implement the approach is 


an exhaustive search of the strategy table. 


The steps described in the last section are now applied 
to the strategy table given in Table 6.3. The technigue 
used, exhaustive search, is by no means intended as an 


example of an efficient algorithn. 


An exhaustive search of the upper-left 11 by 11 sub- 
matrix in ST yields M = 4 as the optimal solution upper 
bound. Note that the strategies to delete MO(3) and MO(6) 
are deletion impedance conflicts, except for strategy 
options 2 and 7. Also, all strategies to delete NO(5) and 
MO(7) are deletion impedance conflicts. Thus, either MO(5) 
or MO(7) may be deleted, but not both; and, MO(3) and MO(6) 


may only be deleted if strategy options 2 and 7 are in the 


selected set. 
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Among those possible strategy sets determined in Step 2 
with M = 4, the following set is discussed in terms of 
Step 3 examinations: (1,2,6,7). For the examination of 
(1,2,6,7), begin with strategy option 1, and examine the PO 
Sub-matrix to determine that strategy points 18 and 22 
require alteration. Now, examine the SP sub-matrix to 
determine nonconflicting strategy options for altering 18 
and 22. Strategy point 18 may be altered by strategy 
option 2, with no conflict. Strategy point 22 may be altered 
by strategy 13, with no conflict. tenth examine the PO  sub- 
matrix to determine strategy points for strategy option 2. 
They are 20 and 24. An examination of the SP shows’ that 
strategy point 20 may be altered by strategy option 7, with 
no conflict. Strategy point 24 may be altered by strategy 
option 15, (16, or 17. From Constraint 4 in Theorem 6.3, 
strategy option 15 is not possible for G = ({1,2,6,1), 
because it is an absolute conflict with strategy option 2. 
Also from Constraint 4, strategy options 16 and 17 are not 
possible, because they are absolute strategy conflicts with 
strategy option 7. Therefore, with G = (1,2,6,/), no G' may 


be selected such that G u (G gd G') is optimal. 


An examination of the SC sub-matrix shows that only sets 
which contain strategy options 2 and 7 may result in the 
deletion of M actions. An examination of the PO and SP sub- 
matrices shows that the alteration of strategy point 24 
(ceguired for strategy option 2) always results in a 


conflict with strategy option 7. Therefore, optimal deletion 
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is impossible with strategy options 2 and 7 where M = 4. M 


is decremented by one, and Step 2 is initiated. 


Three of many possible strategy sets G, for M = 3 are 
Clee eh lig (1752S) 7) and (156,9).. Clearly, G =9(1,2.1) e2snuot 
optimal because of the previously described conflict between 


strategy options 2 and 7, 


For G = (1,5,8), required strategy points are 18 and 22. 
By selecting the strategy set G* = (12,13), an optimal 
Strategy set 12s defined by Gu (6 ¢ Gt) = (1,5,8712,13)~ The 
application of this optimal strategy set upon the original 
program node resuits in the optimized program node given in 
Figure 6.1 [note that strategy options 5 and 8 involve fan- 


out at [qspfeand I (7) }. 


IT (1) GATE {2} {3} 
I (3) ADD {334} £7 ¢5} 
T (4) ADD {152} {2} 
E(t} ADD {234} {9; 8} 
I (8) SUB {733} {3} 


Figure 6.1 Optimized Form of Program Node in Figure 3.1, 
With Fan-out 


It is possible to restrict analysis to those strategy 
options which require no fan-out. For this example, the 
strategy set G = (1,6,9) results in an optimal strategy set 
that requires no fan-out strategies. The strategy set 
GY i=" (42, 13,710,2,9,16). 15 selected to alter srrategy points 


18, 22, 23, 19, 20, and 24 respectively. These are all the 
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Strategy points required by G and G'. Thus, the optimal 
strategy set Gu (G A G') = (1,2,6,9,10,12,13,16). Note that 
action deletions described by strategy options 2 and 10 are 
impeded. The application of this optimal strategy set upon 
the original program node results in the program node given 


an Fig@dres6. 2. 


I (1) GATE {2} {3} 
qT (3) ADD {334} {5} 
T (4) ADD {132} {2} 
I:(7) ADD {234} {8} 
I (8) SUB {533} {3} 


Figure 6.2 Optimized Form of Program Node in Figure 3.1, 
With No Fan-out 


In summary of the analysis approach, deletion 
Strategies, strategy points, and strategy conflicts have 
been used to determine a strategy set which results in the 
optimal deletion of actions. It can be concluded that the 
identification of alternate action forms Significantly 
increases the opportunities for nonessentiai action 


deletion. 


Ka “a final point, itcis: interesting | to jmnote “that ..abl 
nonessential action deletions defined by Klier and 
Ramamoorthy [27] may be described in terms of the sf 
structure. That is, those strategy options which require no 
strategy point alteration; are not in conilret sf dayenct 
relate to an analysis of exit status [MO(N+1) ]; and, which 


relate to parallel actions that specified the identical 
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input memory sub-units at the source-level. In Short, 600 
strategies would exist for action deletion in the program 


node given in Figure 3.1. 
6.6 Summary 


Chapter 3 established a framework for the analysis of 
alternate action forms. A notation was also presented for 
the general description of a microprogram. Chapter 4 
presented and proved an aigorithm which identifies all 
alternate and equivaient action forms. Using these alternate 
action forms, Chapter 5 described a minimal set of 
conditions required for the deletion of a nonessential 
action. The application of these conditions to a program 
node yields a set of deletion candidates. This Chapter 
described the determination of all possible deletion 
strategies. Then, by examination of deletion strategies, 
strategy points, and strategy conflicts, the optimal 


deletion of actions was shown to be feasible. 


With reference to the solution approach described in 
SECtION 3.3; Chapter 4 presented an algorithm for the 
identification of all forms F(m), of action(m), m=1,2,...,N. 
Through an analysis of F({(m), Chapter 5 defined the 
conditions necessary to identify all candidates for 
deletion, D. This Chapter described the identification of 
strategies which may be employed in the deletion of actions 
in D. Then, through an analysis of conflicting strategies, 


it was shown that it is possible to select one form, Z(k), 
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for each action, such that the maximum number of actions are 
deleted from the program node, Z(k) € F(m), k=1,2,...,N. The 
next Chapter presents study conclusions and recommendations 


for further research. 
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CHAPTER VII 


SUMMARY, CONCLUSIONS, AND RECOMMENDATIONS 


This study posed and solved three major problems in the 
discipline of Microprogram Optimization. These problems are 
the identification of alternate and equivalent action forms; 
the definition of deletion strategies which include an 
analysis of alternate and equivalent action forms; and, the 
determination of a set of deletion strategies which may be 
employed in the optimal deletion of nonessential actions. 
The solution of these problems results in a Significant 
increase in the opportunities for nonessential action 
deletion; i.e., actions in a program node may be altered to 


force other actions to assume a nonessential status. 


Before the problems were addressed, a notation was 
presented for the generai description of a microprogran. 
This notation was designed to include any encoding structure 
for a micro-instruction and time validity constraints for a 
micro-operation. The results of this study may be applied 
directly to any microprogram expressed in the proposed 
notation. Also, many software programming languages may be 
mapped directly onto the notation for a highly encoded 
monophase micro-instruction. Therefore, the results of this 
study, with qualifications, may be shared by the discipline 


of General Compiler Optimization. 
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In designing the framework for the study, it was found 
that the set of redundant actions is covered by the set of 
parallel actions. This serves to generalize the approach to 
nonessential action deletion by restricting examination to 
actions which reference the same output memory sub-unit and 


to actions which are equivalent. 


To aid in understanding the combinatoric nature of the 
problems addressed, a graph-theoretic description of actions 
tt a program node was presented. However, for the 
identification of alternate and equivalent action forms, a 
graph-theoretic approach was shown to involve needless 
recursive analysis of sub-graphs. Alternate and equivalent 
action forms may be identified by an iterative deterministic 


algorithm. 


The first major problem was solved by the proposal and 
proof of an algorithm which identifies all alternate and 
equivalent action forms. Equivalent actions are identified 
by the application of a hashing function. The identification 
of equivalent actions allows all equivalently defined memory 
sub-units to be identified. The qualification of these units 
by their index of definition established a means of 
identifying all possible inputs and outputs for an action, 
including those action forms which may only exist after 


other actions have been changed or deleted. 
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With respect to order of computation, the algorithn 
requires a maximum of two examinations of each action in the 
program node. The primary limitation of the algorithm is 
that run-time statistics are not used to identify alternate 
action forms. It can be seen, however, that the inclusion of 
run-time statistics would only serve to increase the number 
of equivalent actions identified; the theoretical deletion 
advantage of alternate action identification would be 


unaltered. 


The second major problem was solved by the definition of 
a minimal set of conditions which must be satisfied in order 
for an action to be deleted. These conditions are applied to 
the actions in a program node to determine those actions 
which may participate as candidates for deletion. By 
successive examination of these deletion conditions with 
respect to each deletion candidate, it was shown that all 
possible deletion strategies may be identified. Furthermore, 
all locations in the program node are identified, where 


these strategies may be applied. 


A significant problem encountered in the definition of 
deletion conditions was that action deletions may have an 
order-isomorphic relationship to each other. Because these 
cases are identifiable in the application of deletion 
conditions, it was possible to formulate a means of avoiding 
order-isomorphisms. Thus, the convergence of deletion 


conditions and deletion strategies is assured. 
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A basis for the solution of the third major problem was 
eStablished by the definition of strategy conurili crs. 
Strategy conflicts were used to prove the existence and 
identification of an optimal solution upper bound; i.e., the 
maximum number of actions which may be deleted from the 
program node. Strategy conflicts were also used to prove 
that the existence of an optimal strategy set may be 
determined. Finaldy; a deterministic optimai strategy 


selection approach was described. 


The major limitations upon the analysis of deletion 
candidates are that branch-points are not included, and 
program region analysis is not performed. The resumption of 
this study without these limitations provides a possibility 
for future research. Global optimization procedures which 
have been applied to software languages may also be 
applicable to microprogramming languages. For example, it 
may be possible to identify program node equivalencies; 
thereby, deleting an entire node. In a broad perspective, 
the results of this study would be based upon an analysis of 
regional properties and units which are busy on entry into 
each program node. The branch-point limitation presents a 
special problem because of the varied schemes used to select 


the next micro-instruction. 


The implementation of the optimal strategy selection 
approach provides numerous opportunities for future 


research. From the standpoint of feasibility, the approach 
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described is acceptable; however, there may exist better 
approaches, from the standpoint of implementation. For 
example, the strategy selection problem may be examined for 
conformity to a graph coloration problem (possibly convert 
Strategy conflicts into preassignment constraints). It may 
also be possible to assign weights to analysis variables and 
approach the problem in terms of linear programming or game 
theory. Finally, the problem may conform to a_ scheduling 
problem where origins are units and destinations are 


actions. 


The results of this study clearly establish a new 
criterion for the evaluation of program fitness: The 
relationship of equivalent actions and equivalently defined 
memory sub-units to program efficiency. It is believed, 
although not proved, that a relationship exists between 
unnecessary actions and the number of actions over which 


equivalently defined memory sub-units exist. 


As a finai recommendation, this study may have 
applications in the general area of hardware evaluation. 
Simply stated, the action deletion advantage gained by fan- 
out may be compared to the action deletions where fan-out is 
not possible. The result of this comparison may provide a 
valuable tool for the evaluation of advantages gained from 


parallel hardware units. 
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